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15 GOVERNMENT LICENSE RIGHTS STATEMENT 

The U.S. Government has a paid-up license in this 
invention and the right in limited circumstances to 
require the patent owner to license others on 
reasonable terms, as provided for by the terms of 

20 Grant No. GM16549-01 awarded by the National Institutes 
of Health. 



BACKGROUND OF THE INVENTION 

25 

Field of the Invention 

The present invention relates to heregulin 
variants, nucleic acid molecules encoding such 
variants, and related vectors, host cells, 
30 pharmaceutical compositions, and methods. 
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In particular, the invention relates to amino acid 
substitution variants of human heregulin-/3l having an 
enhanced affinity for the ErbB-3 and ErbB-4 receptors. 

5 Description of the Related Art 

Transduction of signals that regulate cell growth 
and differentiation is regulated in part by 
phosphorylation of various cellular proteins. Protein 
tyrosine kinases are enzymes that catalyze this 

10 process. Receptor protein tyrosine kinases are 
believed to direct cellular growth via ligand- 
stimulated tyrosine phosphorylation of intracellular 
proteins. Growth factor receptor protein tyrosine 
kinases of the class I subfamily include the 170 

15 kilodalton (kDa) epidermal growth factor receptor 
(EGFR) encoded by the erbBl gene. erbBl has been 
causally implicated in human malignancy. In 
particular, increased expression of this gene has been 
observed in more aggressive carcinomas of the breast, 

2 0 bladder, lung, and stomach. 

The second member of the class I subfamily, pies' 36 " 
(also called the ErbB-2 receptor or plSS 1 * 5 * 2 ) , was 
originally identified as the product of the 
transforming gene from neuroblastomas of chemically 

25 treated rats. The neu (erJbB2 or HER2 ) gene encodes a 
185 kDa receptor protein tyrosine kinase. 

Amplification and/or overexpression of the human 
erJbB2 gene correlates with a poor prognosis in breast 
and ovarian cancers. Slamon et al . , Science 235:177-82 

30 (1987); Slamon et al . , Science 244:707-12 (1989). 

Overexpression of erbB2 has been correlated with other 
' carcirromas including carcinomas of the stomach, 

endometrium, salivary gland, lung, kidney, colon and 
bladder. Accordingly, in U.S. Patent No. 4,968,603, 

35 Slamon et al . describe and claim various diagnostic 
assays for determining erJbB2 gene amplification or 
expression in tumor cells. Slamon et al . discovered 
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that the presence of multiple copies of the erJbB2 
oncogene in tumor cells indicates that the disease is 
more likely to spread beyond the primary tumor site, 
and »that the disease may therefore require more 
5 aggressive treatment than might otherwise be indicated 
by other diagnostic factors. Slamon et al . conclude 
that the erJbB2 gene amplification test, together with 
the determination of lymph node status, provides 
greatly improved prognostic utility. 

10 ' A further related gene, called erbB3 (or HER3 ) , 

which encodes the ErbB-3 receptor (plSO^ 3 ) has also 
been described. See U.S. Patent No. 5,183,884; Kraus 
et al., PNAS USA 86:9193-97 (1989); EP Patent 
Application No. 4 4 4,961A1; Kraus et al . , PNAS USA 

15 90:2900-04 (1993). Kraus et al . (1989) discovered that 
markedly elevated levels of erJbB3 mRNA were present in 
certain human mammary tumor cell lines indicating that 
erJbB3, like erbBl* and erbB2 , may play a role in human 
malignancies. Also', Kraus et al . (1993) showed that 

20 EGF-dependent activation of the ErbB-3 catalytic domain 
of a chimeric EGFR/ErbB-3 receptor resulted in a 
proliferative response in transfected NIH-3T3 cells. 
Furthermore, these researchers demonstrated that some 
human mammary tumor cell lines display a significant 

25 elevation of steady-state ErbB-3 receptor tyrosine 

phosphorylation, further implicating this receptor in 
human malignancies. The role of erJbB3 in cancer has 
been explored by others, and this gene has been found 
to be overexpressed in breast (Lemoine et al . , Br. J . 

30 Cancer 66 : 1116-21 [1992]), gastrointestinal (Poller et 
al., J. Pathol. 168:275-80 [1992]; Rajkumer et al . , J\ 
4 Pathol*. 170:27^-78 [1993]; Sanidas et al . , Int. J. 
Cancer 54:935-40 [1993]), and pancreatic cancers 
(Lemoine et al . , J. Pathol. 168:269-73 [1992], and 

35 Friess et al . , Clinical Cancer Research 1:1413-20 
[1995] ) . 
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The class I subfamily of growth factor receptor 
protein tyrosine kinases has been further extended to 
include the ErbB-4 (HER4) receptor, which is the 
product of the erbB4 (HER4) gene. ^ee EP Patent 
5 Application No. 599,274; Plowman et al., PNAS USA 

90:1746-50 (1993); and Plowman et al . , Nature 366:473- 
75 (1993). Plowman et al . found that increased erbB4 
expression closely correlated with certain carcinomas 
of epithelial origin, including breast adenocarcinomas. 
10 Diagnostic methods for detection of human neoplastic 
conditions (especially breast cancers) that evaluate 
erJbB4 expression are described in EP Patent Application 
No. 599,274. 

The quest for the activator of the erbB2 oncogene 

15 has lead to the discovery of a family of heregulin 

polypeptides. In humans, the heregulin polypeptides 
characterized thus far are derived from alternate 
splicing of a single gene which was mapped to the short 
arm of chromosome 8 * by Lee and Wood, Genomics 16:790-91 

20 (1993) . 

Holmes et al . isolated and cloned a family of 
polypeptide activators for the ErbB-2 receptor which 
they called heregulin-a (HRG-a) , heregulin-/3l (HRG-/31) , 
heregulin-02 (HRG-02) , and heregulin-/33 (HRG-03). 

25 See Holmes et al . , Science 256:1205-10 (1992); 

WO 92/20798; and U.S. Patent No. 5,367,060. These 
researchers demonstrated the ability of the purified 
heregulin polypeptides to activate tyrosine 
phosphorylation of the ErbB-2 receptor in MCF7 breast 

30 tumor cells. Furthermore, the mitogenic activity of 
the heregulin polypeptides on SK-BR-3 cells (which 
* express high levels of the ErbB-2 receptor) was also 
demonstrated . 

Heregulins are large multi -domain proteins that 

35 are typically expressed as "pro-heregulins . " Pro- 
heregulins have been shown to undergo proteolytic 
processing to a mature soluble form (usually of about 
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44-45 kDa) . Processing has been shown to occur 
intracellu^arly or at the cell surface. Domains in the 
soluble form include (in order from the N- to the 
C- terminus) an immunoglobulin homology (Ig r like) 
5 domain, a spacer region rich in glycosylat ion sites, 
and a domain similar to a domain found in EGF that is 
sufficient for ErbB receptor binding and activation. 
See Barbacci, et al . , J . Biol. Chem. 270:9585-89 
(1995) . 

10 The heregulin EGF- like domains are characterized 

by substantial structural similarities to (Jacobsen et 
al., Biochemistry 35 : 3402-17 [1996]), and limited 
sequence homology with, EGF residues 1-48 (Holmes, et 
al . , supra) . Functional similarities between the 

15 heregulin EGF - like domains and EGF have been 

established by data showing that blocks of EGF sequence 
substituted into heregulin-/?l do not impair binding to 
cells co-expressing ErbB-3 and ErbB-2. Barbacci et 
al . , supra. . 

20 While heregulins are substantially identical in 

the first 213 amino acid residues, they are classified 
into two major types, a and /3, based on two EGF-like 
domains that differ in their C-terminal portions. For 
example, the heregulin-a EGF-like domain differs from 

25 that of the 01-isoform by nine substitutions near the 
C-terminus. The /S-isoform has been reported to bind 
ErbB receptors with approximately eight to 10 -fold 
higher affinity than the a-isoform. Wen et al . , Mol . 
Cell. Biol. 14:1909-19 (1994). 

30 The solution structure of the heregulin-Of EGF 

domain has recently been determined at high resolution 
> by NMR Jacobsen et al . , supra; Nagata et al . , EMBO J. 
13, 3517-3523 (1994) . The salient features of this 
domain include (1) an N- terminal subdomain containing a 

35 central three -stranded 0-sheet with an intermittent 
helix and (2) a smaller C-terminal subdomain that 
contains a short stretch of /3-sheet. The EGF domain is 
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stabilized by three disulfide bonds, two in the 
N- terminal subdomain and one in the C- terminal 
subdomain. The pairing of the six corresponding 
cysteine residues is conserved in EGF-like domains from 
5 all heregulins and from EGF . 

The 44 kDa neu differentiation factor (NDF) , which 
is the rat equivalent of human HRG, was first described 
by Peles et al . , Cell, 69:205-16 (1992), and Wen et 
al., Cell, 69:559-72 (1992). Like the human heregulin 

10 polypeptides, NDF has an Ig-like domain followed by an 
EGF-like domain and lacks a N-terminal signal peptide. 
Subsequently, Wen et al . carried out "exhaustive 
cloning" to extend the family of NDFs . Wen et al . , 
Mol. Cell. Biol. t 14:1909-19 (1994). This work 

15 revealed six distinct fibroblastic pro-NDFs. Adopting 
the nomenclature of Holmes et al . , the NDFs were 
classified as either a or (3 polypeptides based on the 
sequences of the EGF-like domains. Isoforms 1 to 4 are 
characterized on the basis of a variable region between 

20 the EGF-like domain and transmembrane domain. Also, 
isoforms a, b and c are defined based on variable- 
length cytoplasmic domains. These researchers conclude 
that different NDF isoforms are generated by 
alternative splicing and perform distinct tissue- 

25 specific functions. See also EP 505 148; WO 93/22424; 
and WO 94/28133 (discussing NDF) . 

Falls et al . , Cell 72:801-815 (1993) describe 
another member of the heregulin family which they call 
"acetylcholine receptor inducing activity (ARIA) 

30 polypeptide." The chicken-derived ARIA polypeptide 

stimulates synthesis of muscle acetylcholine receptors. 
* See WO-94/08007. ARIA is a /3-type heregulin and lacks 
the entire spacer region between the Ig-like domain and 
EGF-like domain of HRG -a and HRG01-/33- 

35 Marchionni et al . , Nature 362:312-318 (1993) 

identified several bovine -derived proteins that they 
call "glial growth factors (GGFs) . " These GGFs share 
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the Ig-like domain and EGF-like domain with the other 
heregulin protein^ described above, but also have an 
amino- terminal kringle domain. GGFs generally do not 
have the complete spacer region between the Ig-like 
5 domain and EGF-like domain. Only one of the GGFs, 
GGFII, has an N-terminal signal peptide. See also 
WO 92/18627; WO 94/00140; WO 94/04560; WO 94/26298; 
WO 95/32724 (describing GGFs and uses thereof) . 

Ho et al . describe another member of the heregulin 
10 family called "sensory and motor neuron-derived factor 
(SMDF) . 11 Ho et al., J. Biol. Chew. 270:14523-32 
(1995) . This protein has an EGF-like domain 
characteristic of all other heregulin polypeptides but 
a distinct N-terminal domain. In addition, SMDF lacks 
15 both the Ig-like domain and the spacer region found in 
other heregulin polypeptides. Another feature of SMDF 
is the presence of two stretches of hydrophobic amino 
acids near the N-terminus. 

While the hereguiin polypeptides were first 
20 identified based on their ability to activate the 

ErbB-2 receptor (see Holmes et al . , supra) , it has been 
discovered that certain ovarian cells expressing neu 
(erbB2) and neu- transf ected fibroblasts did not bind or 
crosslink to NDF, nor did they undergo tyrosine 
25 phosphorylation in response to NDF. Peles et al . , EMBO 
J. 12:961-71 (1993). This finding indicated that 
another cellular component was necessary for conferring 
full heregulin responsiveness. 

Carraway et al . subsequently demonstrated that 
30 125 I-rHRG-/3l 177-244 bound to NIH-3T3 fibroblasts stably 
transfected with bovine erJbB3 but not to non- 
4 transfected parental cells. These researchers also 
expressed bovine ErbB-3 receptor in insect cells and 
showed that HRG-01 177-244 bound to a preparation of 
35 ErbB-3 receptor solubilized from these cells. They 

concluded that ErbB-3 is a receptor for heregulin and 
mediates phosphorylation of intrinsic tyrosine residues 
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as well as phosphorylation of ErbB-2 receptor in cells 
that express both receptors. Carraway et al . , J. Biol. 
Chem. 269:14303-06 (1994). Sliwkowski et al . found 
that cells transfected with erJbB3 alone show low 
5 affinities for heregulin, whereas cells transfected 
with both erbB2 and erbB3 show higher affinities. 
Sliwkowski et al . , J. Biol. Chem. 26 9:14661-65 (1994). 

Plowman and his colleagues have similarly studied 
ErbB-4/ErbB-2 receptor activation. They expressed the 

10 ErbB2 receptor alone, the ErbB4 receptor alone, or the 
two receptors together in human T lymphocytes and 
demonstrated that heregulin is capable of stimulating 
tyrosine phosphorylation of ErbB-4, but could only 
stimulate ErbB-2 phosphorylation in cells expressing 

15 both receptors. Plowman et al . , Nature 336:473-75 
(1993) . 

These observations are consistent with the 
"receptor cross - talking " concept described previously 
by Kokai et al . , Cell* 58 : 287-92 (1989), Stern et al . , 

20 EMBO J. 7:995-1001 (1988), and King et al . , 4:13-18 

(1989) . These researchers found that binding of EGF to 
the EGFR resulted in activation of the EGFR kinase 
domain and cross -phosphorylation of the ErbB-2 
receptor. This is believed to be a result of ligand- 

25 induced receptor heterodimerization and the concomitant 
cross -phosphorylat ion of the receptors within the 
heterodimer. Wada et al . , Cell 61:1339-47 (1990). 

Thus, the ErbB receptors are believed to be 
activated by ligand- induced receptor dimerization . 

30 Specifically, heregulins can bind separately to ErbB-3 
and ErbB-4 receptors, but not to the ErbB-2 receptor. * 
# However, ErbB-2 is required for signalling, and 
heterodimers containing ErbB-2 in combination with 
ErbB-3 or ErbB-4 bind heregulins with higher affinity 

35 than homodimers containing ErbB-3 or ErbB-4. Plowman 
et al., Nature 366":473-75 (1993); Sliwkowski et al . , 
J. Biol. Chem. 2^9:14661-65 (1994). 
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The biological activities of heregulins have been 
investigated by several groups. For example, Holmes 
et al . (supra) found that heregulin exerts a mitogenic 
effect on mammary cell lines (such as SK-BR-3 and 
5 MCF-7) . Lewis et al . reported that heregulin-/3l 

stimulated proliferation and enhanced colony formation 
in soft agar in a number of human breast and ovarian 
tumor cell lines. Lewis et al . , Cancer Research 
56:1457-65 (1996). These researchers also showed that 

10 ErbB-2 is a critical mediator of heregulin 
responsiveness . 

Pinkas-Kramarski et al . found that NDF (rat 
heregulin) is expressed in neurons and glial cells in 
embryonic and adult rat brain and primary cultures of 

15 rat brain cells, and suggested that NDF may act as a 

survival and maturation factor for astrocytes. Pinkas- 
Kramarski et al . , PNAS USA 91:9387-91 (1994). 
Danilenko et al . reported that the interaction of NDF 
and the ErbB-2 receptor is important in directing 

20 epidermal migration and differentiation during wound 
repair. Danilenko et al . , Abstract 3101, FASEB 
8 (4-5) :A535 (1994) . 

Meyer and Birchmeier analyzed expression of mouse 
heregulin during embryogenesis and in the perinatal 

25 animal using in situ hybridization and RNase protection 
experiments. Meyer and Birchmeier, PNAS USA 91:1064-68 
(1994) . These authors conclude, based on expression of 
this molecule, that heregulin plays a role in vivo as a 
mesenchymal and neuronal factor. Their findings also 

30 indicated that heregulin functions in the development 
of epithelia. _ 

Falls et al. (supra) found that chicken ARIA plays 
a role in myotube differentiation, namely affecting the 
synthesis and concentration of neurotransmitter 

35 receptors in the postsynaptic muscle cells of motor 

neurons. Corf as and Fischbach demonstrated that ARIA 
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also increases the number of sodium channels in chick 
muscle. Corf as and Fischbach, J". Neuroscience 
13 :2118-25 (1993) . 

Bovine GGFs have been reported to be mitogenic for 
5 Schwann cells. See, e.g., Brockes et al . , «J. Biol. 
Che/7?. 255:8374-77 (1980); Lemke and Brockes, 
J". Neurosci. 4:75-83 (1984); Brockes et al . , 
J . Neuroscience 4:75-83 (1984); Brockes et al . , Ann. 
Neurol. 20:317-22 (1986); Brockes, Methods in Enzym. 

10 147:217-225 (1987); Marchionni et al . , supra. Schwann 
cells provide myelin sheathing around the axons of 
myelinated neurons and thus play an important role in 
the development, function and regeneration of 
peripheral nerves. The implications of this role from 

15 a therapeutic standpoint have been addressed by Levi et 
al. , J. Neuroscience 14:1309-19 (1994). Levi et al . 
discussed the potential for construction of a cellular 
prosthesis including Schwann cells that could be 
transplanted into areas of damaged spinal cord. 

20 Methods for culturing Schwann cells ex v^vo have been 

described. See WO 94/00140; Li et al . , J\ Neuroscience 
16 : 2012-19 (1996) . 

GGFII has been shown to be mitogenic for 
subconfluent quiescent human myoblasts, and 

25 differentiation of clonal human myoblasts in the 

continuous presence of GGFII results in greater numbers 
of myotubes after six days of differentiation. Sklar 
et al., J . Cell Biochem. , Abst . W462, 18D, 540 (1994); 
see also WO 94/26298. 

30 The relationship between the structure and 

function of new proteins can be investigated using any 
of a variety of available mutational analysis 
techniques. Examples of such techniques include 
alanine scanning mutagenesis and phagemid display. 

35 Alanine scanning can be used to identify active 

residues (i.e., residues that have a significant effect 
on protein function) in a protein or protein domain. 
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For example, Cunningham and Wells used alanine scanning 
to identify residi'°.s in human growth hormone that were 
important for binding its receptor. Cunningham and 
Wells, Science 244:1081-85 (1989). In alanine 
5 scanning, a gene encoding the protein or domain to be 
scanned is inserted into an expression vector, and 
mutagenesis is carried out to generate a series of 
vectors that encode proteins or domains in which 
sequential residues are converted to alanine. The 

10 encoded proteins or domain are expressed from these 

vectors, and the activities of the alanine - substituted 
variants are then tested to identify those with altered 
activity. An alteration in activity indicates that the 
residue at the alanine-substituted position is an 

15 active residue. 

Phagemid display was developed to allow the 
screening of a large number of variant polypeptides for 
a particular binding activity. Smith and Parmley 
demonstrated that foreign peptides can be "displayed" 

20 efficiently on the surface of filamentous phage by 

inserting short gene fragments into gene III of the fd 
phage. Smith, Science 228:1315-17 (1985); Parmley and 
Smith, Gene 73:305-18 (1985). The gene III coat 
protein is present in about five copies at one end of 

25 the phage particle. The modified phage were termed 
"fusion phage" because they displayed the foreign 
peptides fused to the gene III coat protein. As each 
fusion phage particle displayed approximately five 
copies of the fusion protein, this mode of phage 

30 display was termed "polyvalent display." 

Scott et al . and Cwirla et al . showed that fusion 
* phage libraries co:. :d be screened by sequential 

affinity selections known as "panning." Scott et al . , 
Science 249:386-90 (1990); Cwirla et al . , PNAS USA 

35 87:6378-82 (1990). However, early efforts to select 
high affinity fusion phage failed, presumably due to 
the polyvalence of the phage particles. This problem 
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was solved with the development of a "monovalent" phage 
display system in which the fusion protein is expressed 
at a low level from a phagemid and a helper phage 
provides a large excess of wild- type coat protein. 
Bass et al . , Proteins 8:309-14 (1990); Lowman et al . , 
Biochem. 30:10832-38 (1991). Monovalent phage display 
can be used to generate and screen a large number of 
variant polypeptides to isolate those that bind with 
high affinity to a target of interest. 



SUMMARY OF THE INVENTION 
The present invention provides a heregulin 
variant having an amino acid sequence not found in 
nature and the ability to bind an ErbB receptor. In 
15 one embodiment, the variant has an amino acid 

substitution at a selected residue corresponding to a 
residue of 645-amino acid native human heregulin-/3l 
selected from the 'group consisting of : 

S177, H178, L179,' V180, K181, E184, E186, 
20 K187, T188, V191, N192, G193, G194, 'E195, 

M198, V199, K200, D201, N204, P205, S206, 
R207, Y208, L209, K211, P213, N214, E21S, 
T217, G218, D219, Q222 , N223 , Y224, S228, and 
F229 . 

25 In a variation of this embodiment, the amino acid 
substitution is not a replacement of the selected 
residue with an epidermal growth factor (EGF) residue 
corresponding to the selected residue. 

The heregulin variant can be a variant of any 
3 0 member of the heregulin family from any species. In 

one embodiment, the heregulin variant is a variant of a 
' human heregulin, such as, for example, human 
heregulin-/?l. The invention provides a human 
heregulin-/?l variant including an amino acid 
35 substitution selected from the group consisting of: 
S177W; H178S, E, R, or A; V180Q, I, or E; 
K181P or A; A183G; E184V, W, K, R, G, or N; 
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K185E, S, Q, or G; E186R; K187E or A; T188Q; 
E195Q; F197Y; M198R or K; K200R; D201T or I; 
P205T or Y; S206K, H, G, P, or R; R207Y; 
Y208R or L; L209M or G; K211R; P213S, T, N, 
5 or K; N214L, K, S, or E; F216M; N223H or W; 

and M226I . 

In a variation of this embodiment, the heregulin 
variant includes sets of amino acid substitutions 
selected from this group. Some heregulin variants of 

10 the invention having sets of amino acid substitutions 
exhibit at least a 50-fold increase in ErbB-3 receptor 
affinity, which is also accompanied by an increase in 
ErbB-4 receptor affinity. 

The invention also includes a heregulin variant 

15 that has a greater specificity for the ErbB-4 receptor, 
relative to the ErbB-3 receptor, than the heregulin 
from which the heregulin variant is derived. In one 
embodiment, this heregulin variant has an amino acid 
substitution at a selected residue corresponding to a 

20 residue of 645-amino acid native human heregulin-01 

selected from the group consisting of H178, L.179, and 
R207 . 

In another embodiment, a heregulin variant having 
a greater specificity for the ErbB-4 receptor, relative 
25 to the ErbB-3 receptor, has a deletion of amino acid 
residues corresponding to residues S228 to K231 of 
645-amino acid native human heregulin-/Sl and a 
substitution of a single methionine for the deleted 
residues . 

30 In addition to including one or more of the 

mutations disclosed herein, the heregulin variant can ' 
' have one or more other modifications, such as an amino 
acid substitution, an insertion of at least one amino 
acid, a deletion of at least one amino acid, or a 

35 chemical modification. For example, the invention 

provides a heregulin variant that is a fragment. In a 
variation of this embodiment, the fragment includes 
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residues corresponding to a portion of human 
heregulin-^l extending from about residue 175 to about 
residue 230 (i.e., the EGF-like domain). In a further 
variation of this embodiment, the fragment includes 
5 residues corresponding to a portion of human 

heregulin-/3l extending from about residue 175 to about 
res idue 24 5 . 

One aspect of the invention is a method for 
producing such a modified heregulin variant. The 
10 modification (s) is selected so that the modified 
heregulin variant retains the ability to an ErbB 
receptor . 

In addition to a heregulin variant, the invention 
provides a related nucleic acid molecule, vector, and 

15 host cell . The invention also provides a method of 
producing a heregulin variant in which a host cell 
containing an expression vector capable of expressing 
the heregulin variant is cultured under conditions that 
allow expression of the heregulin variant, and the 

20 heregulin variant then recovered from the culture. 

Other aspects of the invention relate to various 
uses of a heregulin variant. For example, the 
invention provides a method for activating an ErbB 
receptor in which the heregulin variant is contacted 

25 with a cell that expresses an ErbB receptor. The 
heregulin variant can be contacted with cells in 
culture, for example, to promote 

ex vivo survival, proliferation, or differentiation of 
cells, such as glial, Schwann, or muscle cells. 

30 Alternatively, the heregulin variant can be 

combined with a pharmaceut ically acceptable carrier an'd 
t used to treat one of a wide range of cancers as well as 
diseases and disorders affecting the nervous system, 
musculature, and epithelia. Thus, the present 

35 invention provides a pharmaceutical composition and a 
treatment method. 
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The invention also includes a method of 
determining whether a sample contains an ErbB receptor 
that binds a heregulin. In particular, a heregulin 
variant is contacted with a sample, and specific 
5 binding between the heregulin variant and a component 
of the sample is determined as an indication of the 
presence and/or amount of ErbB receptor (s) present in 
the sample. 

10 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows an alignment between the amino acid 
sequences in the EGF-like domains of a number of 
heregulin polypeptides and the amino acid sequence of 
the human heregul in- 01 EGF-like domain ("betal") . The 

15 aligned sequences are as follows: human heregulin-a 

(alpha) , human heregul in- (32 and -03 <beta2 and beta3) , 
neu differentiation factors a2 and bl to b4 (ndfa2 and 
ndfbl-4), glial growth factor II (ggf ) , sensory and 
motor neuron-derived factor (smdf ) , human heregul in--y , 

20 and acetylcholine receptor inducing activity 

polypeptide (aria) . The number shown for each sequence 
is the residue number of the first amino acid shown, as 
numbered from the N- terminus of the native polypeptide 
whose sequence is shown. indicates the differences 

25 between the a- and 0-type EGF-like domains. 

Figure 2 shows the results of an alanine scan of 
the heregulin-/3l EGF-like domain ( heregulin-/3l residues 
177-228) . Individual amino acids in this domain were 
mutated to alanine and displayed monovalently on phage 

30 as gill fusion proteins, as described in Example 2. 

The histogram shows the change in binding affinity of . 
, each alanine variant for ErbB-3 and ErbB-4 receptor-Ig 
fusions (ErbB-3-Ig and ErbB-4-Ig) , as measured by phage 
ELISA. The X axis lists each amino acid that was 

35 changed to alanine and its position. The Y axis is the 
ratio of the EC 50 for each variant to the EC 50 for the 
wild-type heregulin-01 EGF-like domain, also displayed 
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on phage. The EC S0 was calculated as the concentration 
of soluble receptor fusion required to displace 50% of 
the total amount of phage bound to immobilized receptor 
fusion. ErbB-3 binding results are shown with black 
5 bars, and ErbB-4 binding results are shown with white 
bars . 

Figure 3 shows the amino acids selected for 
binding to ErbB-3-Ig at each position in the 
heregulin-/3l EGF-like domain (heregul in- 01 residues 
10 177-228) randomized in the phage display studies 
described in Example 3 . The length of the bars 
indicates the frequency of occurrence of a particular 
amino acid at each position in the variants from phage 
display libraries A-E and G-I for which sequences were 
15 determined (i.e., a longer bar indicates a higher 

frequency) . Twelve clones were sequenced from each 
library, although in library H, only one clone of the 
twelve representee? a variant having mutations in the 
desired randomization -window (see Example 3). »WT" 
20 indicates the wild-type amino acid sequence of the 
heregulin-01 EGF-like domain. 

Figure 4 shows the amino acid substitutions in the 
EGF-like domains of combination variants described in 
Example 3. The amino acid sequence of the wild-type 
25 heregulin-/3l EGF-like domain (HRG8) , a variant of this 
domain containing a deletion of heregul in- 01 residues 
202-204 (HRG63) , and the analogous domain in EGF are 
shown on top. The residue numbering for the portion of 
the heregulin-/3l amino acid sequence shown is indicated 
30 above this sequence (numbered from the N-terminus of 

native human heregul in- (31 ) . The residue numbering for 
, the portion of the EGF amino acid sequence shown is 
indicated below this sequence (numbered from the 
N-terminus of native human EGF). A " . " indicates a 
35 residue that is identical to the wild-type residue at 
the particular position. A " - " indicates the absence 
of a residue. 
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DETAILED DESCRIPTION OF THE INVENTION 
The p^ :sent invention provides heregulin variants 
having one or more amino acid substitutions at selected 
residues. Included within the scope of the invention 
5 are variants of human heregulins, and, in particular, 
variants of human heregul in-/3l . A heregulin variant 
according to the invention can have a single amino acid 
substitution at a selected residue or combinations of 
such substitutions. 

10 In addition to the amino acid substitutions 

specified herein, heregulin variants according to the 
invention can have further modifications, including, 
for example, deletions of amino acids. In one 
embodiment, a heregulin variant has N- and C- terminal 

15 deletions, leaving only amino acids corresponding to 

"the minimal EGF-like domain, " which is sufficient for 
binding and activation of an ErbB receptor. 

A heregulin variant of the invention is capable of 
binding an ErbB receptor, such as ErbB-3 or ErbB-4 . In 

20 one embodiment, the variant has an enhanced affinity 
for an ErbB receptor compared to the affinity of the 
most homologous native heregulin. In addition to ErbB 
receptor binding, the heregulin variant can possess one 
or more other biological activities of a native 

25 heregulin. 

The invention also provides nucleic acid 
molecules, vectors, and host cells related to the 
heregulin variants. A nucleic acid molecule of the 
invention encodes, or is complementary to a nucleic 

30 acid molecule encoding, a heregulin variant of the 
invention or a fragment thereof. The nucleic acid 
* molecuje can be double- or single - stranded DNA or RNA. 
A nucleic acid molecule of the invention can be 
inserted into an appropriate vector for propagation 

35 and/or expression of an encoded heregulin variant. 

Such vectors are introduced into suitable hosts, for 
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example, to allow recombinant production of a heregulin 
variant . 

The heregulin variants of the invention are useful 
in a variety of therapeutic and non- therapeutic 
5 applications. In particular, heregulin variants can be 
used in treating cancer and various diseases and 
disorders of the nervous system, musculature, and 
epithelia. Accordingly, the invention encompasses a 
pharmaceutical composition including a heregulin 

10 variant and related treatment methods. 

Heregulin variants can also be employed in a 
variety of non- therapeutic applications, such as cell 
culture methods and diagnostic methods. For example, 
heregulin variants can be used to promote the ex vivo 

15 survival, proliferation, or differentiation of cells, 
including glial and muscle cells. In an exemplary 
diagnostic application, heregulin variants are employed 
in the diagnosis ( of a cancer characterized by erJbB 
(e.g., erJbB2) overexpression . Accordingly, the 

20 invention also includes kits useful in practicing the 
above -described methods . 

Definitions 

As used herein, the following words or phrases 
25 have the definitions indicated below, unless otherwise 
indicated . 

The terms "amino acid" and "residue" are used 
interchangeably herein . 

The term "wild-type amino acid" or "wild-type 
30 residue" means the amino acid present at a given 
position (s) in a native polypeptide. 
4 Amino acids are denoted herein by the standard 

three-letter or one-letter code. 

Residues in two or more polypeptides are said to 
35 "correspond" if the residues occupy an analogous 

position in the polypeptide structures. As is well 
known in the art, analogous positions in two or more 
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polypeptides can be determined by aligning the 
polypeptide sequrices based on amino acid sequence or 
structural similarities. Those skilled in the art 
understand that it may be necessary to introduce gaps 
5 in either sequence to produce a satisfactory alignment. 
For example, residues in human EGF that correspond to 
residues in human heregulin-/?l are shown in an 
alignment between the amino acid sequence of the 
heregulin-/5l EGF-like domain (heregulin-/3l residues 

10 177-228) and the analogous EGF domain (EGF residues 
1-48) in Figure 4. 

Residues in two or more heregulins are said to 
"correspond" if the residues are aligned in the best 
sequence alignment. The "best sequence alignment" 

15 between two polypeptides is defined as the alignment 
that produces the largest number of aligned identical 
residues. The best sequence alignment for a number of 
heregulin polypeptides is shown in Figure 1. 

Residue positions in heregulin-/3l are designated 

20 herein by the three-letter or one-letter. code for the 
amino acid, followed by the position number, as 
numbered from the N-terminus of native human pro- 
heregul in-/3l (which is 645 amino acids in length) . 
E.g., the serine at position 177 of heregulin-/3l is 

25 denoted "Serl77" or "S177." 

Hereinafter, unless otherwise indicated, residue 
positions in a heregulin, heregulin variant, or related 
protein, such as EGF, are specified herein with 
reference to the amino acid numbering of native human 

30 heregulin-jSl. For example, a heregulin-£l variant can 
have a N-terminal deletion of residues 1-176. The 
^ first amino acid in this variant is identified herein 
as "the residue corresponding to Serl77 of 645-amino 
acid human heregulin-/3l " because the first residue of 

35 the heregulin variant and Serl77 of heregul in- /Si are 
aligned in the best alignment between the two 
polypeptides . 
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Example 3 discloses heregulin-/3l variants 
containing residues corresponding to residues 177 to 
228 of heregulin-01 , which is termed "the minimal 
EGF-like domain." For these variants, residue numbers 
5 also are expressed, in parentheses, in terms of the 

position of the residue in the minimal EGF-like domain 
(hereinafter » heregul in- 01 EGF " or "HRG-01 EGF" ) , i.e., 
residues 1-52. Residue positions numbered according to 
native human heregulin-/3l can be converted to residue 

10 positions in the minimal EGF-like domain by subtracting 
176 from the former position number. E.g., for 
heregulin-/3l Serl77, subtracting 176 from 177 gives 1, 
and thus heregul in-01 EGF Serl identifies the same 
position as heregul in-£l Serl77 . The same numbering 

15 system is used in Example 4 . 

Amino acid substitutions are indicated by listing 
the residue position followed by the code for the amino 
acid substituted into the heregulin polypeptide. Thus, 
a substitution of alanine at Serl77 of heregulin-/3l is 

20 expressed as "heregulin-/3l Serl77Ala, " "Serl77Ala" , or 
"S177A." In this example, serine is the "replaced 
amino acid, " and alanine is the "replacement amino 
acid . " 

As used to describe two amino acid sequences, the 
25 term "homologous" indicates that the amino acid 

sequences have some degree of amino acid sequence 
identity . 

Heregulin Variants 
3 0 The present invention includes a heregulin 

variant. The term "heregulin variant" means a 
^polypeptide variant of a native heregulin. A native 
neregulin is defined as a polypeptide having the full- 
length amino acid sequence of any of the family of 
35 naturally occurring heregulin polypeptides. This 
family encompasses pro-heregulins as well as the 
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soluble forms of these proteins. The invention is 
exemplified with variants of human heregulin-01 . See 
Examples 1-3. However, the heregulin family 
encompasses any naturally occurring polypeptide having 
5 an EGF-like domain that has at least 70 percent 
sequence identity with the EGF-like domain of 
heregulin-/3l when these domains are aligned in the best 
alignment. Thus, a native heregulin can be from any 
species and one of a number of naturally occurring 
10 isoforms or allelic forms. Exemplary heregulin 

polypeptides include neu differentiation factors, glial 
growth factors, sensory and motor neuron-derived 
factor, and acetylcholine receptor inducing activity 
polypeptide . 

15 In 6ne embodiment, the heregulin variant is a 

variant of a mammalian heregulin. In a variation of 
this embodiment, the heregulin variant is a variant of 
a human heregulin. Examples of human heregulins 
include heregulin-a -(HRG-a) , heregulin-01 (HRG-/S1) , 
20 heregulin-02 (HRG-/32) , heregulin-03 (HRG-/S3) , and 
heregulin-y ( HRG - y ) . 

A heregulin variant according to the invention has 
an amino acid sequence not found in nature in which a 
wild-type residue in a native heregulin is replaced 
25 with a different residue. This amino acid substitution 
is at one or more selected residues corresponding to a 
residue of native human heregulin-/3l . The selected 
residue (s) is chosen from the following group: 
S177, H178, L179, V180, K181, E184, E186 , 
30 K187, T188, V191, N192, G193, G194 , E195, 

M198, V199, K200, D201, N204, P205, S206, 
R207, Y208, L209, K211, P213, N214, E215, 
T217, G218, D219, Q222, N223 , Y224, S228, and 
F229 . 

35 In one embodiment, however, the amino acid substitution 
is not a replacement of the selected residue with an 
EGF residue corresponding to the selected residue. 
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Substitution of any of the above residues in 
heregulin-/3l with alanine has at least a two-fold 
effect on affinity for the ErbB-3 or ErbB-4 receptors, 
as determined by phage ELISA. See Example 2. 
5 In one embodiment, the selected residue is chosen 

from the following group: 

S177, H178, L179, E186, K187, T188, V191, 
N192, G193, G194, E195, R207, L209, K211, 
P213, N214, T217, G218, Q222, Y224, and F229. 
10 Substitution of the any of these residues in 

heregulin-/3l with alanine has at least a five-fold 
effect on affinity for the ErbB-3 or ErbB-4 receptors, 
as determined by phage ELISA. See Example 2. 

In a variation of this embodiment, the selected 
15 residue is chosen from the following group: 

H178, L179, K187, N192, G193, G194, E195, 
R207, K211, T217, G218, Q222 , and Y224 . 
Substitution of the any of these residues in 
heregulin-/?! with alanine has at least a 10-fold effect 
20 on affinity for the ErbB-3 or ErbB-4 receptors, as 
determined by phage ELISA. See Example 2. 

Generally, if function is to be preserved at a 
position selected for substitution, the residue used to 
replace the selected residue is not substantially 
25 different in character from the wild- type residue, 
i.e., the amino acid substitution is a conservative 
substitution. Amino acids can be grouped according to 
character as shown in Table 1 . 
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Table 1 
Groups of Amino Acids 
Having Similar Character 



5 Group 
a 
b 
c 
d 

10 e 



Character 
positively charged 
negatively charged 
amide 
aromatic 
hydrophobic 

uncharged 
hydrophilic 



Amino Acids 

Lys, Arg, His 

Asp, Glu 

Asn, Gin 

Phe , Ty r , Trp 

Pro, Gly, 

Ala, Val, 

Leu, lie, Met 

Ser, Thr 



To preserve function, therefore, the residue used to 
replace the wild-type residue is usually selected from 

15 the same group or a related group. In addition, serine 
or alanine can be used to replace most other residues. 
Table 2 shows conservative substitutions for each amino 
acid, identifying related groups for each and 
indicating which amino acids can be replaced with 

20 serine or alanine. Table 2 also snows preferred amino 
acid substitutions. 
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Table 2 

Conservative Amino Acid Subs ti tut ions 



Replacement Amino Preferred 
AA' Acid Selected From Substitutions 





Ala 


e* : Pro, 


Gly, Ala, 


Val, Leu, lie, 


Met 


Ser 






10 




f: Ser, 


Thr 
















Arg 


a : Lys , 


Arg, 


His 






Lys 










Ser, Ala 










Ser, 


Ala 






Asn 


a : Lys , 


Arg, 


His 
















c : Asn , 


Gin 








Gin 






15 




Ser, Ala 










Ser, 


Ala 






Asp 


b: Asp, 


Glu 








Glu 










c: Asn, 


Gin 


















Ser, Ala 










Ser, 


Ala 






Cys 


e: Pro, 


Gly, 


Ala, 


Val, Leu, He, 


Met 


Ala 






20 




f: Ser, 


Thr 








Ser 








Gin 


a : Lys , 


Arg , 


His 
















c: Asn, 


Gin 








Asn 










Ser, Ala 










Ser , 


Ala 






Glu 


b : Asp , 


Glu 








Asp 






25 




c: Asn, 


Gin 


















Ser, Ala 










Ser , 


Ala 






Gly 


e: Pro, 


Gly, 


Ala, 


Val, Leu, He, 


Met 


Pro, 


Ala 








f: Ser, 


Thr 
















His 


a : Lys , 


Arg , 


His 






Arg 






30 




Ser, Ala 










Ser , 


Ala 






He 


e: Pro, 


Gly, 


Ala„ 


Val, Leu, Met 




Ala, 


Val , 


Leu 




Leu 


e: Pro, 


Gly, 


Ala, 


Val, He, Met 




Ala, 


Val, 


He 




Lys 


a : Lys , 


Arg, 


His 






Arg 










Ser, Ala 










Ser , 


Ala 




35 


Met 


e : Pro , 


Gly, 


Ala, 


Val, Leu, He 


, Met 


Ala, 


Val, 


Leu, 




Phe 


a : Lys , 


Arg, 


His 
















d : Phe , 


Tyr, 


Trp 






Tyr 










Ala, Val 


, Leu , 


He 






Ala, 


Val, 


Leu , 




Pro 


a : Lys , 


Arg , 


His 












4 0 




d : Phe , 


Tyr , 


Trp 






Phe 










Gly, Ala 










Gly, 


Ala 






Ser 


a : Lys , 


Arg, 


His 






Thr 










f: Ser, 


Thr 


















Ala 










Ala 






45 


Thr 


a : Lys , 


Arg , 


His 
















f: Ser, 


Thr 








Ser 










Ala 










Ala 








Trp 


d : Phe , 


Tyr , 


Trp 






Phe 










Ala 










Ala 






50 


Tyr 


d : Phe , 


Tyr, 


Trp 






Phe, 










Ala, Val 


, Leu, 


He 






Ala, 


Val, 


Leu, 




Val 


e: Pro, 


Gly, 


Ala, 


Val, Leu, He 


, Met 


Leu , 


He, 








Ser, Ala 








Ser , 


Ala 





55 ^ ' Wild-type amino acid. 

* Lower-case letters rerer to the groups of amino acids in Table 1. 
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10 



15 



20 



25 



30 



35 



In one embodiment, the heregulin variant is a 
variant of l.uman heregulin-/3l and includes an amino 
acid substitution (s) chosen from the following group: 
S177W; H178S, E, R , or A; V180Q, I, or E; 
K181P or A; A183G, T, or D; E184V, W, K, R, 
G, or N; K185E, S, Q, or G; E186R; K187E or 
A; T188Q; E195Q; F197Y; M198R or K; K200R; 
D201T or I; P205T or Y; S206K, H, G, P, or R; 
R207Y; Y208R or L; L209M or G; K211R; P213S, 
T, N, or K; N214L, K , S, or E; F216M; N223H 
or W; and M22 6I . 

Another aspect of the invention is a variant of 
human heregulin-/3l including a set of amino acid 
substitutions, such as any of the heregulin variants 
described in Example 3 . The set of amino acid 
substitutions is chosen from the group indicated below 
(the variant number from Example 3 is shown in the 
left-hand column, f followed by the set of amino acid 
substitutions for that variant) : 



B5 : 


A183G, 


E184W, 


K185D, 


E186R, 


K18.7E, 


T188G, 




M226I ; 












BIO : 


A183D, 


E184K, 


K185S, 


E186R, 


K187E, 


T188G, 




M226I ; 












Dl : 


F197Y, 


M198K, 


K200R, 


D201I , 


M226I ; 




E2 : 


P205Y, 


S206G, 


R207Y, 


Y208L, 


L2 0 9M; 




E3 : 


P205Y, 


S206R, 


R207Y, 


Y208R, 


L209M, 


M226I ; 


E6 : 


P205T, 


S206H, 


R207Y, 


Y208R, 


L2 0 9M; 




E8 : 


P205T, 


S206K, 


R207Y, 


Y208R, 


L209G; 




11 : 


N223W, 


M226I ; 










12 : 


N223H, 


M226I ; 










HRG3 7 : 


S177W, 


H178E, 


K181P, 


A183G, 


E184W, 


K185D, 




E186R, 


K187E, 


T188G, 


M226I ; 






HPG4 8 : 


P205Y, 


S206G, 


R207Y, 


Y208L, 


L209M, 


rI226I; 


HRG53 : 


A183G, 


K185E, 


E186R, 


K187E, 


T188G, 


F197Y, 




M198R, 


D201T; 










HRG54 : 


A183G, 


K185E, 


E186R, 


K187E, 


T188G, 


P205Y, 




S206G, 


R207Y, 


Y208L, 


L209M; 
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HRG55 : 


A183G, 


K185E, 


E186R, 


K187E, 


T188G, 


F197Y, 






M198R, 


D201T, 


P205Y, 


S206G, 


R207Y, 


Y208L,, 






L209M; 














HRG56 : 


A7_83G, 


K185E, 


E186R, 


K187E, 


T188G, 


M226I ; 


5 


HRG5 7 : 


F197Y, 


M198R, 


D201T, 


P205Y, 


S206G, 


R207Y, 






Y208L, 


L2 0 9M; 












HRG5 8 : 


F197Y, 


M198R, 


D201T, 


P205Y, 


S206G, 


R207Y, 






Y208L, 


L2 0 9M, 


M226I ; 










HRG5 9 : 


F197Y/ 


M198R, 


D201T, 


M226I ; 






10 


HRG6 0 : 


A183G, 


K185E, 


E186R, 


K187E, 


T188G, 


F197Y, 






M198R, 


D201T, 


M226I ; 










HRG61 : 


A183G, 


K185E, 


E186R, 


K187E, 


T188G, 


P205Y, 






S206G, 


R207Y, 


Y208L, 


L209M, 


M226I ; 






HRG6 2 : 


A183G, 


K185E, 


E186R, 


K187E, 


T188G, 


F197Y, 


15 




M198R, 


D201T, 


P205Y, 


S206G, 


R207Y, 


Y208L, 






L209M, 


M226I ; 












HRG71 : 


F197Y, 


M198R, 


D201T, 


P205Y, 


S206G, 


R2 0 7Y, 






Y208L, 


.L209M, 


N223H, 


M226I ; 


and 






HRG7 3 : 


A183G, 


K185E, 


E186R, 


K187E, 


T188G, 


F197Y, 


20 




M198R, 


D201T, 


P205Y, 


S206G, 


R207Y, 


Y208L, 






L.209M, 


N223H, 


M226I . 










Each of these se 


ts of amino acid substitutions produce 




at least 


a five- 


fold increase 


in ErbB 


-3 receptor 




affinity, 


as det 


ermined 


by phage ELISA. See 


Example 3 


25 


In a 


variat 


ion of 


this embodiment, the 


set of 




amino acid subst 


itutions is chosen from the 


following 




group : 
















B5 : 


A183G, 


E184W, 


K185D, 


E186R, 


K187E, 


T188G, 






M226I / 












30 


E2 : 


P205Y, 


S206G, 


R207Y, 


Y208L, 


L2 0 9M; 






12 : 


N223H, 


M226I; 












, HRG4 8 : m 


P205Y, 


S206G, 


R207Y, 


Y208L, 


L209M, 


M226I ; 




HRG53 : 


A183G, 


K185E, 


E186R, 


K187E, 


T188G, 


F197Y, 






M198R, 


D201T; 










35 


HRG56 : 


A183G, 


K185E, 


E186R, 


K187E, 


T188G, 


M226I; 




HRG5 7 : 


F197Y, 


M198R, 


D201T, 


P205Y, 


S206G, 


R207Y, 






Y208L, 


L209M; 
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HRG58: F197Y, M198R, D201T, P205Y, S206G, R207Y, 

Y208L, I°09M, M226I; 
HRG5 9 : F197Y, M198R, D201T, M226I; 

HRG60: A183G, K185E, E186R, K187E, T188G, F197Y, 

5 M198R, D201T, M226I; 

HRG61: A183G, K185E, E186R, K187E, T188G, P205Y, 

S206G, R207Y, Y208L, L209M, M226I; 
HRG6 2 : A183G, K185E, E186R, K187E, T188G, F197Y, 

M198R, D201T, P205Y, S206G, R207Y, Y208L, 
10 L209M, M226I; 

HRG71: F197Y, M198R, D201T, P205Y, S206G, R207Y, 

Y208L, L209M, N223H, M226I; and 
HRG73: A183G, K185E, E186R, K187E, T188G, F197Y, 

M198R, D201T, P205Y, S206G, R207Y, Y208L, 
15 L209M, N223H, M226I. 

Each of these sets of amino acid substitutions produces 
at least a 20-fold increase in ErbB-3 receptor 
affinity, as determined by phage ELISA. See Example 3. 
In another variation of this embodiment, the set 
20 of amino acid substitutions is chosen from the 
following group : 

HRG58: F197Y, M198R, D201T, P205Y, S206G, R207Y, 

Y208L, L209M, M226I; 
HRG60: A183G, K185E, E186R, K187E, T188G, F197Y, 

25 M198R, D201T, M226I; 

HRG71: F197Y, M198R, D201T, P205Y, S206G, R207Y, 

Y208L, L209M, N223H, M226I; and 
HRG73: A183G, K185E, E186R, K187E, T188G, F197Y, 

M198R, D201T, P205Y, S206G, R207Y, Y208L, 
30 L209M, N223H, M226I. 

Each of these sets of amino acid substitutions produces 
, at least a 50-fold increase in ErbB-3 receptor 

affinity, as determined by phage ELISA. See Example 3. 
The invention also includes a heregulin variant 
35 that has a greater specificity for the ErbB-4 receptor, 
relative to the ErbB-3 receptor, than the heregulin 
from which the heregulin variant is derived. Such 
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enhanced ErbB-4 receptor specificity can be measured, 
for example, as a lower variant : wild- type EC 50 ratio for 
binding to the ErbB-4 receptor than for binding to 
ErbB-3 receptor. Generally, a heregulin variant having 
5 enhanced ErbB-4 receptor specificity has approximately 
wild-type affinity for the ErbB-4 receptor or better 
together with a significantly reduced (i.e., at least 
about 5-10-fold) affinity for the ErbB-3 receptor. 
In one embodiment, a heregulin variant having 

10 enhanced ErbB-4 receptor specificity has an amino acid 
substitution at a selected residue corresponding to a 
residue of 645-amino acid native human heregulin-01 
selected from the group consisting of H178 , L179, and 
R207. For example, the invention provides a human 

15 heregulin-01 variant including the amino acid 
substitution H178E or R207P. 

In another embodiment, a heregulin variant having 
enhanced ErbB-4 receptor specificity has a deletion of 
amino acid residues * corresponding to residues S228 to 

20 K231 of 645-amino acid native human heregulin-/?l and a 
substitution of a single methionine for the deleted 
residues. In a variation of this embodiment, the 
heregulin variant also includes the amino acid 
substitution H178L. 

25 In addition to the above amino acid substitutions, 

the heregulin variant can optionally have any of the 
following modifications, singly or in combination: 

(1) one or more additional amino acid substitutions; 

(2) one or more amino acids added to the N- or 

30 C-terminus of, or inserted within, the amino acid 
sequence of the heregulin variant; (3) one or more 
A amino acids deleted from the heregulin variant; and 

(4) one or more covalent modifications of an amino acid 
in the heregulin variant. 

35 Thus, the heregulin variant can be "substantially 

full-length," which, as used herein, means that the 
heregulin variant is at least 90% as long as the native 
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heregulin to which the variant is most homologous. 
Alternatively, the heregulin variant can be a 
"fragment" that is less than 90% as long as the most 
homologous native heregulin. Heregulin variants that 
5 are fragments are usually about 30 to about 100 amino 
acids, more usually about 4 0 to about 6 0 amino acids, 
even more usually about 4 5 to about 65 amino acids, and 
most usually about 50 amino acids in length. 

For example, the heregulin variant can include 

10 amino acids corresponding to "the minimal EGF-like 

domain." The minimal EGF-like domain is a portion of a 
native heregulin that is sufficient for binding and 
activation of an ErbB receptor. In general, the 
minimal EGF-like domain is less than about 70 amino 

15 acids and usually less than about 60 amino acids in 
length. As used herein with reference to human 
heregulin-/5l , the minimal EGF-like domain extends from 
residues 177-228. » Unless otherwise indicated, "HRG-/S1 
EGF" refers to the minimal EGF-like domain. 

20 Examples of suitable covalent modifications of a 

heregulin variant according to the invention include, 
but are not limited to, conjugation with a detectable 
label, "pegylation, " and conjugation with a cytotoxic 
agent. A heregulin variant can be conjugated to any of 

25 a wide variety of available labels to produce a 

conjugate useful for detecting the presence of ErbB 
receptors in a sample. Suitable labels include a 
radioisotope, a fluorescent label, and an enzyme label. 
Exemplary radioisotope labels are 35 S, 14 C, 125 I, 3 H, and 

30 131 I . Heregulin variants can be conjugated to 

radioisotopes as described generally in Current 
, Protocols in Immunology Vols. 1 & 2 (Coligen et al . 
ed. , Wiley Publishers) . 

Fluorescent labels suitable for conjugation to a 

35 heregulin variant include a rare earth chelate (a 
europium chelate), fluorescein, rhodamine, dansyl , 
Lissamine, phycoerythrin , and Texas Red, and 
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derivatives thereof. Conjugates can be prepared as 
described, for example, in Current Proto cols in 
Immunology supra . 

Various enzyme - substrate systems are available, 
and U.S. Patent No. 4,275,14 9 provides a review of some 
of these. In general, enzymes useful in such systems 
catalyze a readily detectable chemical alteration of a 
substrate. For example, the enzyme can catalyze a 
color change, which can be measured spectrophoto- 
metrically, or a change in fluorescence or 
chemiluminescence , which can be detected using a 
fluorometer or chemiluminometer , respectively. 
Exemplary enzyme labels include a lucif erase, malate 
dehydrogenase, urease, a peroxidase, alkaline 
phosphatase, /3-galactosidase , glucoamylase , lysozyme , a 
saccharide oxidase, a heterocyclic oxidase, 
lactoperoxidase , microperoxidase , and the like. 
Heregulin variants can be conjugated to enzyme labels 
as described generally in O' Sullivan et al . , Methods In 
Enzym, 73:47-166 (1981), and in Current Protocols in 
Immunology (supra) . Suitable substrates for use with a 
given enzyme label are well known to those skilled in 
the art . 

Another exemplary modification of a heregulin 
variant of the invention is pegylation, which refers to 
the conjugation of one or more polyethylene glycol 
(PEG) groups to the e- amino group (s) of a polypeptide. 
Pegylation may be desired when the heregulin variant is 
intended for pharmaceutical use, as pegylation can 
increase in vivo half -life and/or reduce immunogenici ty 
and potential toxicity of therapeutic proteins. See,, 
e.g., Abuchowski et al . , J. Biol. Chew. 252:3582-86 
(1977) . 

Conjugation of a heregulin variant with a 
cytotoxic agent produces a targeted cytotoxic agent 
that specifically binds cells expressing appropriate 
ErbB receptors on their surface. The term "cytotoxic 
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agent" refers to a substance that inhibits or prevents 
the function of cells and/or causes destruction of 
cells. The term includes, for example, a radioactive 
isotope (e.g., I, Y, Pr) and a chemotherapeut ic agent. 
5 A "chemotherapeutic agent" is defined herein as 

any chemical compound useful in the treatment of 
cancer. The term "cancer" refers to the physiological 
condition in mammals that is characterized by 
unregulated cell growth. Examples of cancer include 

10 but are not limited to, carcinoma, lymphoma, blastoma, 
sarcoma, and leukemia. More particular examples of 
such cancers include squamous cell cancer, small-cell 
lung cancer, non-small cell lung cancer, gastric 
cancer, pancreatic cancer, glial cell tumors such as 

15 glioblastoma and neurofibromatosis, cervical cancer, 

ovarian cancer, liver cancer, bladder cancer, hepatoma, 
breast cancer, colon cancer, colorectal cancer, 
endometrial carcinoma, salivary gland carcinoma, kidney 
cancer, renal cancer prostate cancer, vulval cancer, 

20 thyroid cancer, hepatic carcinoma and various types of 
head and neck cancer. Examples of chemotherapeutic 
agents include Adriamycin, Doxorubicin, 5 -Fluorouracil 
(5-FU) , Cytosine arabinoside (Ara-C) , Cyclophosphamide, 
Thiotepa, Busulfan, Cytoxin, Taxol , Methotrexate, 

25 Cisplatin, Melphalan,, Vinblastine, Bleomycin, 

Etoposide, If osf amide, Mitomycin C, Mitoxantrone , 
Vincristine, VP-16, Vinorelbine, Carboplatin, 
Teniposide, Daunomycin, Carminomycin, Aminopterin, 
Dactinomycin, a Mitomycin, Nicotinamide, an 

30 Esperamicin, Melphalan and any related nitrogen 
mustard, and an endocrine therapeutic (such as 
diethylstilbestrol [DES] , Tamoxifen, a leutinizing 
hormone releasing, hormone - antagonizing drug, a 
progestin, an ant i -progestin , etc.). 

35 In addition to conjugation to a chemical compound, 

any of the above -described heregulin variants can be 
modified by fusion to a heterologous polypeptide to 
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produce a "chimeric heregulin variant." (Chimeric 
heregulin variants are also referred to herein as 
"fusion proteins.) Typically, the heterologous 
polypeptide is fused at the N- or C terminus of the 
5 heregulin variant to preserve the biological activity 
(described further below) of the heregulin variant. 
However, the heterologous polypeptide can also be 
introduced into regions of the heregulin variant that 
are not critical for biological activity. Generally, 

10 chimeric heregulin variants are produced by recombinant 
techniques. Examples of chimeric heregulin variants 
include a heregulin variant fused to a "signal 
sequence, " a "purification handle" and an 
immunoglobulin sequence . 

15 A "signal sequence" is an amino acid sequence that 

directs the secretion of a polypeptide fused thereto 
from a cell expressing the chimeric protein. Thus, 
fusion of a heregulin variant to a signal sequence 
facilitates recombinant production of the heregulin 

20 variant because the chimeric heregulin variant is 

secreted into the host cell culture medium, from which 
the chimeric heregulin variant can be recovered with 
relative ease. 

A suitable signal sequence can be obtained from 

25 any protein that has a signal sequence and is typically 
(but not always) fused to the N-terminus of the 
heregulin variant. DNA encoding prokaryotic signal 
sequences can be obtained, for example, from lamB or 
ompF, MalE, PhoA, and other genes. A convenient 

30 prokaryotic signal sequence for practicing the 

invention is the E . coll heat-stable enterotoxin II 
& (STII) signal sequence. 

A "purification handle" is a poitxon of a 
polypeptide that binds another polypeptide, termed a 

35 "binding partner." The fusion of a purification handle 
to a heregulin variant confers on the variant the 
ability to bind the binding partner, which facilitates 
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purification of the resultant chimeric heregulin 
variant. Generally, the purification handle is 
selected so that the binding partner does not 
substantially cross-react with other components present 
5 in the mixture from which the chimeric heregulin 

variant is to be purified. As used herein, the term 
"does not substantially cross-react" means that the 
affinity of the binding partner for the purification 
handle is at least about 20-fold, usually at least 
10 about 100-fold, more usually at least about 1000-fold, 
any affinity for any other components present in the 
mixture . 

In one embodiment, the purification handle is an 
epitope recognized by an antibody, and the chimeric 

15 heregulin variant is therefore termed an "epitope- 

tagged heregulin variant." Suitable epitopes generally 
have at least five amino acids, usually between about 
10 and about 5 0 am.ino acids, and more usually between 
about 10 and about 30 amino acids. 

20 A chimeric molecule that includes a heregulin 

variant fused to an immunoglobulin sequence is termed 
"a heregulin variant immunoadhesin. " In one 
embodiment, the immunoglobulin sequence is an 
immunoglobulin constant domain. The immunoglobulin 

25 sequence in a heregulin variant immunoadhesin can be 
obtained from IgG a , IgG 2 , IgG 3 or IgG 4 subtypes, IgA, 
IgE, IgD, or IgM. In one embodiment, the 
immunoglobulin sequence is obtained from IgG x or IgG 3 . 
Other examples of chimeric heregulin variants 

30 include heregulin variants fused to thioredoxin, a 
"salvage receptor binding epitope," or a cytotoxic 
^polypeptide. Fusion of a heregulin variant with 
thioredoxin enhances expression and provides a 
purification handle that facilitates purification using 

35 phenylarsine oxide, which can be covalently bound to a 
solid support, such as agarose. (Agarose 
functionalized with phenylarsine is available 
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commercially as Thibond™ Resin from Invitrogen Corp., 
San Diego, CA. ) Exemplary thioredoxin- variant fusion 
proteins are described in Example 2 . 

The te^m "salvage receptor binding epitope" refers 
5 to an epitope of the Fc region of an IgG molecule 

(e.g., IgG x , IgG 2 , IgG 3 , or IgGj that increases the in 
vivo serum half -life of the IgG. Salvage receptor 
binding epitopes suitable for fusion to a heregulin 
variant according the invention include any of the 

10 known salvage receptor binding epitopes. 

The term "cytotoxic polypeptide" refers to a 
polypeptide that inhibits a cellular function or kills 
cells. Cytotoxic polypeptides suitable for fusion to a 
heregulin variant include an enzymat ically active toxin 

15 of bacterial, fungal, plant, or animal origin and 
fragments thereof and an oncogene product /tyrosine 
kinase inhibitor, such as a peptide that inhibits 
binding of a tyrosine kinase to a SH2 - containing 
substrate protein (-see WO 94/07913, for example). 

20 In one embodiment, a chimeric heregulin variant 

includes a heregulin variant fused to an enzyme that 
converts a "prodrug" to an active drug. Typically, the 
"prodrug" is a precursor or derivative form of a 
cytotoxic drug that is less cytotoxic than the drug 

25 itself and is capable of being enzymat ically activated 
or converted to the cytotoxic drug. The prodrugs of 
this invention include, but are not limited to, a 
phosphate-containing prodrug, a thiophosphate- 
containing prodrug, a sul fate -containing prodrug, a 

30 peptide-containing prodrug, a D-amino acid-modified 

prodrug, a glycosylated prodrug, a @- lactam- containing 
prodrug, a phenoxyacetamide -containing prodrug, a 
phenylacetamide-containing prodrug, 5 - f luorocytosine , a 
5-f luorouridine prodrug, and derivatives thereof. 

35 Examples of cytotoxic drugs that can be derivatized to 
produce a prodrug for use in this invention include, 
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but are not limited to, those chemotherapeutic agents 
described above. 

A heregulin variant according to the invention is 
capable of binding an ErbB receptor. The term "ErbB 
5 receptor" refers to any of the mammalian class I 

tyrosine kinase receptors. Examples of such receptors 
include the ErbB-1 receptor (also known as "the EGF 
receptor"), the ErbB-2 receptor (also called "the HER2 
receptor"), the ErbB-3 (or " HER 3 11 ) receptor, and the 

10 ErbB-4 (or "HER4") receptor. The phrase "capable of 
binding" is used to describe a polypeptide that binds 
another polypeptide with a dissociation constant (K d ) of 
at least 1 mM. 

Exemplary heregulin variants that are capable of 

15 binding the ErbB-3 and ErbB-4 receptors are discussed 
above and in the examples. The production of 
additional heregulin variants, having further 
modifications (e.g, additional amino acid 
substitutions, additions, insertions, or deletions, or 

20 covalent modifications) and of chimeric heregulin 
variants is within the level of skill in the art. 

Furthermore, in light of the teachings herein, 
those skilled in the art can design a large number of 
additional variants that preserve the binding activity 

25 of the heregulin variants of the invention. For 

example, a conservative substitution in a noncritical 
residue of a heregulin variant (as identified in 
Example 2) is not expected to significantly alter ErbB 
receptor binding. Moreover, any effects on ErbB 

3 0 receptor binding can readily be determined in a simple 
binding assay, such as those described in Examples 1-3*. 
^ Thus, the invention encompasses all heregulin variants 
having amino acid substitutions at the specific 
positions discussed above, regardless of any additional 

35 modifications that may be present. 

In addition to ErbB receptor binding, a heregulin 
variant of the invention can possess one or more other 
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biological activities of a native heregulin. For 
example, the heregulin variant can also have the 
ability to activate an ErbB receptor. The phrase 
"ability to activate an ErbB receptor" refers to the 
5 ability to cause the intracellular kinase domain of an 
ErbB receptor to phosphorylate tyrosine residues. 
Generally, receptor activation involves binding of a 
heregulin to a receptor complex of two or more ErbB 
receptors (e.g., an ErbB-2/ErbB-3 or ErbB-2 /ErbB- 4 
10 complex) . Receptor binding activates a kinase domain 
of one or more of the receptors, which results in 
phosphorylation of tyrosine residues in one or more of 
the receptors and/or phosphorylation of tyrosine 
residues in additional substrate polypeptides (s) . ErbB 
15 receptor phosphorylation can be quantified using the 

tyrosine phosphorylation assays described in Example 3 . 

Furthermore, a heregulin variant of the invention 
can be capable of , enhancing the survival, 
proliferation, and or -differentiation of cells having 
20 suitable ErbB receptors. The phrase "enhancing 

survival of cells" refers to increasing the period of 
existence of cells, either in vitro or in vivo, 
relative to the period of existence of cells that have 
not been exposed to the heregulin variant ("untreated 
25 cells" ) . 

The expression "enhancing proliferation of cells" 
means increasing the rate or number of mitotic 
divisions, either in vitro or in vivo, relative to 
untreated cells. An increase in cell proliferation in 
30 cell culture can be detected by counting the number of 
cells before and after exposure to the heregulin 
^variant or by microscopic examination of the degree of 
confluency. Cell proliferation can also be quantified 
by measuring 3 H-thymidine uptake by the cells. 
35 The phrase "enhancing differentiation of cells" 

refers to increasing the extent of cell specialization. 
Cell specialization is characterized by the acquisition 
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of one or more characteristics that differ from those 
of the original cells. Thus, the extent of cell 
specialization is typically determined by screening for 
a change in the phenotype of the cell (e.g., 
5 identifying a change in cellular morphology) . 

Exemplary cells that express ErbB receptors, and 
are therefore responsive to heregulins, include SK-BR-3 
cells, glial cells, glioblastoma cells, Schwann cells, 
hepatocytes, epithelial cells, and muscle cells. Glial 

10 cells are derived from the central nervous system and 
include oligodendrocytes and astrocytes. Muscle cells 
expressing ErbB receptors include muscle cell 
precursors (myoblasts) as well as the more specialized 
skeletal, cardiac, and smooth muscle cells. 

15 Other biological activities that a heregulin 

variant of the invention can possess include induction 
of ion channel (e.g. Na* channel) formation; induction 
of acetylcholine receptor synthesis at the 
neuromuscular junction; enhancement of the formation of 

20 a synaptic junction between a neuron and a muscle, 

nerve, or glandular cell; downregulat ion of estrogen 
receptor; and cell internalization (possibly associated 
with nuclear localization) . 

A heregulin variant is produced by any suitable 

25 method, including peptide synthesis and recombinant 

techniques. Generally, recombinant techniques, which 
are described in detail below, are employed for a 
heregulin variant longer than about 50 or amino acids. 
A heregulin variant having enhanced specificity 

30 for the ErbB-4 receptor, relative the ErbB-3 receptor, 
can be prepared by mutagenizing at one or more 
^heregulin residues and selecting variants having 
enhanced ErbB-4 receptor specificity. Generally, a 
residue chosen for mutagenesis in this context is 

35 characterized by a difference in effect on binding to 
the ErbB-4 receptor, as compared the effect on binding 
to the ErbB-3 receptor. Such residues are evident, for 
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instance, from the alanine -scanning data presented in 
Example 2 . 

Variants having enhanced ErbB-4 receptor 
specificity are selected by screening for binding to 
5 ErbB-4 receptor using any suitable screening method, 
such as monovalent phage display, which described in 
Example 4. The results can be improved by subjecting 
variants to " counter- select ion , " which in this case 
entails the removal of variants that bind with high 
10 affinity to the ErbB-3 receptor. Example 4 

demonstrates that counter- select ion against ErbB-3 -Ig 
produces a significant enrichment in variants 
exhibiting greater specificity for ErbB-4-Ig, relative 
to ErbB-3-Ig, than that of wild-type HRG-/31. 

15 

Nucleic Acid Molecules 
The present invention also includes a nucleic acid 
molecule related po the heregulin variant. The term 
"nucleic acid molecule" encompasses single - stranded and 

20 double -stranded DNA molecules, including. genomic DNA, 
cDNA, DNA produced by an amplification reaction (such 
as polymerase chain reaction ["PCR"]), and DNA produced 
by oligonucleotide synthesis, as well as RNA molecules, 
such as mRNA. Genomic DNA can include non- transcribed 

25 and transcribed regions (such as 5' and 3' non-coding 
regions, introns, and heregulin variant coding 
regions) . cDNA and mRNA molecules contain sequences 
corresponding to transcribed regions . 

A nucleic acid molecule according to the invention 

3 0 has a nucleotide sequence not found in nature and 

encodes, or is complementary to a nucleic acid molecule 
4 encoding, a heregulin variant of the invention or a 
fragment thereof. A complementary nucleotide sequence 
is capable of forming Watson-Crick bonds with its 

35 complement, in which adenine pairs with thymine or v 
uracil and guanine pairs with cytosine. A double- 
stranded DNA molecule encodes one of the heregulin 
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variants, whereas a single - stranded DNA or RNA molecule 
is either the coding (sense) strand or the noncoding 
(anti-sense) strand. When the nucleic acid molecule 
encodes (or is complementary to a nucleic acid molecule 
5 encoding) a fragment of a heregulin variant, the 

fragment includes at least one amino acid substitution. 

Because of the redundancy of the genetic code, 
there are a large number of possible nucleic acid 
molecules related to each heregulin variant. More 
10 specifically, because several different codons encode 
the same amino acid, a large number of different 
nucleic acid molecules encode (or are complementary to 
a nucleic acid molecule encoding) the same heregulin 
variant . 

15 Generally, a heregulin variant of the invention is 

produced by mutating a naturally occurring DNA sequence 
to introduce the desired mutations into the heregulin 
variant amino acid sequence. However, it may also be 
advantageous to change one or more codons in a nucleic 

20 acid molecule without altering the encoded amino acid. 

Examples of such "silent mutations" within the scope of 
the present invention include, for example, mutations 
that create or destroy restriction endonuclease sites 
to facilitate construction of a desired vector and 

25 mutations that enhance expression of the encoded 

heregulin variant. Examples of the latter include 
nucleotide substitutions designed to reduce the 
formation of 5' stem and loop structures in the 
transcribed mRNA or to provide codons that are more 

30 readily transcribed by the selected host (e.g., the 
well-known preference codons for E . coli or yeast 
+ expression) . 

A nucleic acid molecule of the invention can be 
incorporated into a vector (as described further below) 

35 or used, for example, as a hybridization probe or an 

amplification primer. A hybridization probe according 
to the present invention is useful for detecting a 
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nucleic acid molecule containing a desired mutation, 
such as, for example, in screening bacterial 
transf ormants to identify clones containing the mutated 
nucleic acid molecule. 
5 Such probes are generally at least about 

20 nucleotides and usually less than two kilobases. 
The probe includes a number of nucleotides that is 
sufficient, under the hybridization conditions used, to 
hybridize with a mutated sequence to be detected and to 

10 be substantially free from hybridization with other 

sequences. Typically, a probe of the present invention 
is at least about 50 nucleotides, and usually about 
100 nucleotides in length. 

An amplification primer according to the invention 

15 can be used in a conventional amplification protocol, 
such as PCR, to detect a nucleic acid molecule 
containing a desired mutation or to produce sufficient 
amounts of such a molecule for sequencing, insertion 
into a vector, etc. * An amplification primer is 

20 typically used as a member of a primer pair, including 
a 5' upstream primer that hybridizes with the 5' end of 
the nucleic acid sequence to be amplified and a 
3' downstream primer that hybridizes with the 
complement of the 3 ' end of the sequence to be 

25 amplified. 

In general, a primer according to the invention 
includes a number of nucleotides that is sufficient, 
under the hybridization conditions used, to hybridize 
with a mutated sequence and to be substantially free 

30 from hybridization with other sequences. The 

specificity of the primer increases with the number of 
^ nucleotides that hybridize with the mutated sequence. 
In addition, specificity is correlated with the 
proportion of residues in the primer that hybridize 

3 5 with the mutated sequence. A primer of the present 
invention generally includes at least about 
15 nucleotides, and usually at least about 
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20 nucleotides. The primer need not exceed about 
3 0 nucleotides, and usually does not exceed about 
25 nucleotides. In one variation of this embodiment, 
the primer includes between about 2 0 and about 
5 25 nucleotides. Generally, the primers should have a T n 
in the range of about 55°C to about 75°C. In practice, 
the T m is usually between about 60°C to about 65°C to 
facilitate amplification under stringent conditions. 

10 Vectors and Host Cells 

A nucleic acid molecule of the present invention 
can be incorporated into a vector for propagation 
and/or expression in a host cell. Such vectors 
typically contain a replication sequence capable of 

15 effecting replication of the vector in a suitable host 
cell (i.e., an origin of replication) as well as 
sequences encoding a selectable marker, such as an 
antibiotic resistance gene. Upon transformation of a 
suitable host, the vector can replicate and function 

20 independently of the host genome or integrate into the 
host genome. Vector design depends, among other 
things, on the intended use and host cell for the 
vector, and the design of a heregulin variant vector 
for a particular use and host cell is within the level 

25 of skill in the art. 

If the vector is intended for expression of a 
heregulin variant, the vector includes one or more 
control sequences capable of effecting and/or enhancing 
the expression of an operably linked heregulin variant 

30 coding sequence. Control sequences that are suitable 
for expression in prokaryotes, for example, include a • 
4 promoter sequence, an operator sequence, and a ribosome 
binding site. Control sequences for expression in 
eukaryotic cells include a promoter, an enhancer, and a 

35 transcription termination sequence (i.e., a 
polyadenylation signal) . 
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The term "operably linked" means that two nucleic 
acid sequences are in a functional relationship with 
one another. For example, a promoter (or enhancer) is 
operably linked to a coding sequence if it effects (or 
5 enhances) the transcription of the sequence. A 

ribosome binding site is operably linked to a coding 
sequence if it is positioned to facilitate translation. 
Operably linked nucleic acid sequences are often 
contiguous, but this is not a requirement- For 
10 example, enhancers need not be contiguous with a coding 
sequence to enhance transcription of the coding 
sequence . 

A heregulin variant expression vector can also 
include other sequences, such as, for example, nucleic 

15 acid sequences encoding a signal sequence or an 
amplifiable gene. As discussed above, a signal 
sequence directs the secretion of a polypeptide fused 
thereto from a cejl expressing the chimeric protein. 
In the expression vector, nucleic acid encoding a 

20 signal sequence is linked to a heregulin . variant coding 
sequence so as to preserve the reading frame of the 
heregulin variant coding sequence. The inclusion of an 
amplifiable gene (e.g., the dihydrof olate reductase 
[DHFR] gene) in a heregulin variant expression vector 

25 allows selection of host cells containing multiple 
copies of the nucleic acid molecule encoding the 
heregulin variant. 

A vector of the present invention is produced by 
linking desired elements by ligation at convenient 

30 restriction sites. If such sites do not exist, 
suitable sites can be introduced by standard 
# mutagenesis (e.g., si te -directed or cassette 
mutagenesis) or synthetic oligonucleotide adaptors or 
linkers can be used in accordance with conventional 

35 practice. 

The present invention also provides a host cell 
containing a vector of this invention. A wide variety 
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of host cells are available for propagation and/or 
expression of vectors. Examples include prokayotic 
cells (such as E . coli and strains of Bacillus , 
Pseudomonas, and other bacteria) , yeast or other fungal 
5 cells, insect cells, plant cells, and phage, as well as 
higher eukaryotic cells (such as Chinese hamster ovary 
cells and other mammalian cells) . Host cells according 
to the invention include cells in culture and cells 
present in live animals, such as transgenic animals. 

10 See U.S. Patent 5,364,934 for more information on 
vectors and host cells suitable for use in the 
recombinant production of a heregulin variant. 

A vector of the present invention is introduced 
into a host cell by any convenient method, which will 

15 vary depending on the vector-host system employed. 

Generally, a vector is introduced into a host cell by 
transformation (also known as " transf ect ion" ) or 
infection with a virus (e.g., phage) bearing the 
vector. If the host cell is a prokaryotic cell (or 

20 other cell having a cell wall) , convenient 

transformation methods include the calcium treatment 
method described by Cohen et al . , PNAS USA 69:2110-14 
(1972) , and the polyethylene glycol method of Chung et 
al., Nuc. Acids. Res. 16:3580 (1988). If a prokaryotic 

25 cell is used as the host and the vector is a phagemid 

vector, the vector can be introduced into the host cell 
by infection, as described in Example 1. Yeast cells 
can be transformed using polyethylene glycol, for 
example, as taught by Hinnen, PNAS U.S.A. 75:1929-33 

30 (1978) . Mammalian cells are conveniently transformed 
using the calcium phosphate precipitation method 
^ described by Graham et al . , Virology 52:546 (1978), and 
Gorman et al . , DNA and Protein Eng. Tech. 2:3-10 
(1990) . However, other known methods for introducing 

35 DNA into host cells, such as nuclear injection, 

electroporation (see Example 1) , and protoplast fusion 
also are suitable for use in the invention. 
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In one embodiment, a host cell containing a 
nucleic acid molecule encoding a heregulin variant is 
produced by homologous recombination, as described in 
WO 91/06667. Briefly, this method involves 
5 transforming a host cell containing an endogenous 

heregulin gene with a homologous recombination vector 
that includes the sequence to be introduced. The 
homologous recombination vector also includes at least 
one sequence of at least about 150 nucleotides in 

10 length that is homologous with an endogenous sequence 
flanking the endogenous -heregulin gene. Suitable 
flanking sequences are readily identified, for example, 
by the method of genomic walking, using a known native 
heregulin nucleic acid sequence as a starting point. 

15 The homologous recombination vector additionally 

includes an amplifiable gene, such as the DHFR gene. 

Transformation is carried out under conditions 
such that the vector integrates into the host cell 
genome by recombination. Cells that integrate the 

20 vector are then cultured under conditions that select 
for amplification of the amplifiable gene. The 
resulting cells are then screened for high levels of 
heregulin variant production. 

2 5 Recombinant Production of Heregulin Variants 

To produce a heregulin variant recombinantly , host 
cells containing a heregulin variant expression vector 
are prepared and cultured under conditions suitable for 
cell growth and for expression of the heregulin 

30 variant. In particular, the culture medium contains 

appropriate nutrients and growth factors for the host . 
^cell employed. The nutrients and growth factors 
required for growth of a selected host cell are, in 
many instances, well known or can be readily determined 

35 empirically by those skilled in the art. Suitable 
culture conditions for mammalian host cells, for 
instance, are described in Mammalian Cell Culture 
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(Mather ed . , Plenum Press 1984), and in Barnes and 
Sato, Cell 22:649 (1980). 

In addition, the culture conditions should allow 
transcription, translation, and protein transport 
5 between cellular compartments. Factors that affect 
these processes are well-known and include, for 
example, DNA/RNA copy number; factors that stabilize 
RNA; nutrients, supplements, and transcriptional 
inducers or repressors present in the culture medium; 

10 temperature, pH, and osmolality of the culture; and cell 
density. The adjustment of these factors to promote 
expression in a particular vector-host cell system is 
within the level of skill in the art . Principles and 
practical techniques for maximizing the productivity of 

15 in vitro mammalian cell cultures, for example, can be 
found in Mammalian Cell Biotechnology : a Practical 
Approach (Butler ed . , IRL Press 1991) . 

The cell culture procedure employed in the 
production of a heregulin variant of the present 

2 0 invention can be any of a number of well-known 

procedures for large- or small-scale production of 
proteins. These include, but are not limited to, the 
use of a fluidized bed bioreactor, a hollow fiber 
bioreactor, a roller bottle culture system, and a 
25 stirred tank bioreactor system. A heregulin variant 
can be produced, for instance, in a batch, fed-batch, 
or continuous mode process. 

Methods for recovery of recombinant proteins 
produced as described above are well-known and vary 

3 0 depending on the expression system employed. For 

example, if, as is typical, the heregulin variant is 
fused to a signal sequence, the heregulin variant is 
recovered from the culture medium or the periplasm. 
Conveniently, the variant is secreted into the 
35 periplasmic space as a mature protein. The heregulin 
variant can also be expressed intracellularly and 
recovered from cell lysates . 
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The heregulin variant can be purified from culture 
medium or a cell lysate by any method capable of 
separating the variant from components of the host cell 
or culture medium. Typically the heregulin variant is 
5 separated from host cell and/or culture medium 

components that would interfere with the intended use 
of the heregulin variant. As a first step, the culture 
medium or cell lysate is usually centrifuged or 
filtered to remove cellular debris. The supernatant is 

10 then typically concentrated or diluted to a desired 
volume or diafiltered into a suitable buffer to 
condition the preparation for further purification. 

The heregulin variant is typically further 
purified in the same manner as the most homologous 

15 native heregulin, taking account of any substantial 
differences in properties between the two molecules. 
For example, if the heregulin variant is an epitope- 
tagged heregulin variant, purification can be carried 
out using an immunoaf f inity column containing antibody 

20 to the epitope tag. The following exemplary procedures 
for purifying heregulins can be used or adapted for 
purifying a heregulin variant of the invention: 
fractionation on an immunoaf f inity column, 
fractionation on an ion-exchange column, ammonium 

25 sulphate or ethanol precipitation, reverse phase HPLC, 
chromatography on silica, chromatography on heparin 
Sepharose, chromatography on a cation exchange resin, 
chromatof ocusing, SDS-PAGE, and gel filtration (e.g., 
using a High Load Superdex 75™ prep grade column) . 

30 If the heregulin variant is expressed initially as 

an insoluble, aggregated form (especially in bacterial 
host cells) , it may be necessary to solubilize and 
renature the heregulin Variant using techniques 
available in the art for solubilizing and renaturing 

35 recombinant protein refractile bodies. See, e.g., x U.S. 
Patent No. 4,511,502. 
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In one variation of this embodiment, the heregulin 
variant is purified (1) to a degree sufficient to 
obtain at least 15 residues, and preferably 
20 residues, of N- terminal or internal amino acid 
5 sequence, using a spinning cup sequenator, or (2) to 

homogeneity by SDS-PAGE under non- reducing or reducing 
conditions using Coomassie blue stain. As used herein, 
"homogeneity" means less than about 5% contamination 
with other source proteins, as determined by staining 
10 with Coomassie blue . 

Utility of Hereaulin Variants 
Generally speaking, heregulin variants according 
no the invention can be used in the same applications 

15 as native heregulins. Of course, some heregulin 
variants within the scope of the invention may be 
better suited for one application than for other 
applications. However, those skilled in the art can 
readily ascertain which heregulin variants are 

20 appropriate for a given application by using one or 
more conventional assays to determine the biological 
activity of the variants. 

Pharmaceutical Compositions and Treatment Methods 
25 Heregulins are useful in treating a wide range of 

diseases and disorders affecting the nervous system, 
musculature, and epithelia. In addition, heregulins 
can be used in the treatment of cancer. As used 
herein, "treatment" encompasses the treatment of an 
30 existing disease or disorder as well as prophylactic 
measures. 

4 Accordingly, the present invention provides a 

pharmaceutical composition including a heregulin 
variant that is useful in treating any of a variety of 
35 diseases or disorders. In one embodiment, a 

pharmaceutical heregulin variant composition is 
employed to treat a mammal. In particular, the 
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composition is useful for treating humans, farm animals 
(e.g., cows and sheep), zoo animals, animals used in 
sports (e.g., horses), and pets (e.g., dogs and cats). 
In a variation of this embodiment, t 1 3 composition is 
5 used to treat a human patient. 

* A heregulin variant according to the invention can 
be useful in promoting the development, maintenance, 
and/or regeneration of a neuron in vivo. Neurons that 
respond to such a variant include central nervous 

10 system (brain and spinal chord) neurons, peripheral 
nervous system neurons (including sympathetic, 
parasympathetic, sensory, and enteric neurons) , and 
motorneurons . Diseases or disorders amenable to 
heregulin variant treatment arise in individuals who 

15 have suffered nervous system damage due, for example, 
to trauma, surgery, stroke, ischemia, infection, 
metabolic disease, nutritional deficiency, malignancy, 
or a toxic agent . , 

A heregulin variant can provide therapeutic 

20 benefits to such individuals by promoting the survival, 
proliferation, or differentiation of neurons. For 
example, a heregulin variant can be used to promote the 
survival or proliferation of motorneurons that have 
been damaged by trauma or surgery. A heregulin variant 

25 can also be employed to treat motoneuron disorders, 
such as amyotrophic lateral sclerosis (Lou Gehrig's 
disease), Bell's palsy, and various conditions 
involving spinal muscular atrophy or paralysis. In 
addition, a heregulin variant can also be useful for 

30 treating a human neurodegenerative disorder, such as 
Alzheimer's disease, Parkinson's disease, epilepsy, 
, multiple sclerosis, Huntington's chorea, Down's 
Syndrome, nerve deafness, and Meniere's disease. 

Furthermore, a heregulin variant of the invention 

35 can be used to treat neuropathy, especially peripheral 
neuropathy. As used herein, the term "peripheral 
neuropathy" refers to a disorder affecting the 
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peripheral nervous system, most often manifested as one 
or a combination of motor, sensory, sensorimotor, or 
autonomic neural dysfunctions. Examples include, but 
are not limited to, distal sensorimotor neuropathy and 
5 autonomic neuropathies, such as reduced motility of the 
gastrointestinal tract or atony of the urinary bladder. 
Peripheral neuropathies amendable to heregulin variant 
treatment can be inherited, can result from a systemic 
disease, or can be- induced by a toxic agent. Examples 

10 of hereditary neuropathies include Charcot -Marie -Tooth 
disease, Re f sum's disease, Abetal ipoproteinemia , 
Tangier disease, Krabbe ' s disease, Metachromatic 
leukodystrophy, Fabry's disease, and Dej erine -Sottas 
syndrome. Examples of neuropathies associated with 

15 systemic disease include post-polio syndrome; and 
examples of neuropathies induced by a toxic agent 
include those caused by treatment with a 
chemotherapeutic agent . 

A heregulin variant can also be employed to 

20 improve neural function. The beneficial .effects of 

heregulin variant treatment are attributed to induction 
of the formation of ion channels in cell membranes and 
enhancement of the formation of synaptic junctions. 

A heregulin variant according to the invention can 

25 also be used to treat muscle cells and medical 

conditions affecting muscle cells. In particular, such 
heregulin variant can be useful for treating muscle 
damage, decreasing atrophy of muscle cells, and 
increasing muscle cell survival, proliferation and/or 

3 0 regeneration. Examples of pathophysiological 

conditions of the musculature amenable to treatment 
with a heregulin variant include skeletal muscle 
direases (e.g., myopathy or dystrophy), cardiac muscle 
disorders (including atrial cardiac arrhythmias, 

3 5 cardiomyopathy, ischemic damage, congenital disease, 

and cardiac trauma) , and smooth muscle disorders (such 
as arterial sclerosis, vascular lesion, or congenital 
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vascular disease) . A heregulin variant can also be 
employed to reduce hypertension and to increase 
functional acetylcholine receptors on muscle cells 
(e.g., in individuals having myasthenia gravis or 
5 tachycardia) . 

A heregulin variant of the invention can also 
enhance repair and/or regeneration of tissues that 
express ErbB receptors, especially ErbB-2 and either 
ErbB-3 or ErbB-4 receptors. Accordingly, a heregulin 

10 variant can be useful for treating dermal wounds, 

gastrointestinal disease, Barrett's esophagus, cystic 
or non-cystic end stage kidney disease, or inflammatory 
bowel disease. A heregulin variant can also be 
employed to promote reepithelial izat ion in the human 

15 gastrointestinal, respiratory, reproductive, or urinary 
tract . 

In addition, a heregulin variant according to the 
invention can be useful for inhibiting tumor cell 
invasion and metastasis. In particular, a tumor 

20 characterized by reduced endogenous heregulin levels 
(Park et al . , Proc . Am. Assoc, Cancer Res. 34:521 
[1993] ) is responsive to treatment with a heregulin 
variant of the invention. Additionally, a tumor that 
overexpresses ErbB receptors can be treated by using a 

25 heregulin variant conjugated to a cytotoxic agent 

(described above) to direct the cytotoxic agent to the 
tumor tissue. A heregulin variant -enzyme conjugate can 
also be employed to target a prodrug (described above) 
therapy to cells expressing ErbB receptors. 

30 A pharmaceutical composition according to the 

invention is prepared for storage by mixing a heregulin 
variant having the desired degree of purity with an 
optional physiologically acceptable carrier, excipient, 
or stabilizer, such as are described in Remington" s 

35 Pharmaceutical Sciences 16th Edition (Osol ed . , 1980). 
The composition can be stored in the form of a 
lyophilized cake or an aqueous solution. A 
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pharmaceut ically acceptable carrier, excipient, or 
stabilizer is non-toxic to recipients at the dosages 
employed, and can include a buffer (such as a phosphate 
buffer, citrate buffer, and buffers made from other 
5 organic acids) , an antioxidant (e.g., ascorbic acid) , a - 
low-molecular weight (less than about 10 residues) 
polypeptide, a protein (such as serum albumin, gelatin, 
and an immunoglobulin) , a hydrophilic polymer (such as 
polyvinylpyrrolidone) , an amino acid (such as glycine, 

10 glutamine, asparagine, arginine, and lysine), a 

monosaccharide, a disaccharide , and other carbohydrates 
(including glucose, mannose, and dextrins) , a chelating 
agent (e.g., ethylenediaminetetratacet ic acid [EDTA] ) , 
a sugar alcohol (such as mannitol and sorbitol) , a 

15 salt-forming counterion (e.g., sodium), and/or a 

nonionic surfactant (such as Tween™, Pluronics™, and 
PEG) ♦ In one embodiment, the physiologically 
acceptable carrier is an aqueous pH-buffered solution. 
A heregulin variant composition intended for in 

20 vivo administration is typically sterile. 

Sterilization is readily accomplished by filtration 
through a sterile filtration membrane. If the 
composition is stored in lyophilized form, the 
composition can be filtered before or after 

25 lyophilization and reconst itut ion . 

A pharmaceutical heregulin variant composition of 
the invention is generally placed into a container 
having a sterile access port, such as, for example, an 
intravenous solution bag or a vial having a stopper 

30 pierceable by a hypodermic injection needle. 

Methods for administering a pharmaceutical 
heregulin variant composition do not differ from known 
methods for administering therapeutic proteins. 
Suitable routes of administration include, for example, 

35 intravenous, intraperitoneal, intracerebral, 
intramuscular, intraocular, intraarterial, or 
intralesional routes. A pharmaceutical heregulin 
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variant composition can be administered continuously by 
infusion or r>y bolus injection. 

If desired, a sustained- release preparation can 
also be used to administer a heregulin variant. An 
5 exemplary sustained-release preparation has a 

semipermeable matrix of a solid hydrophobic polymer to 
which the heregulin variant is attached. Examples of 
suitable polymers include a polyester, a hydrogel, a 
polylactide, a copolymer of L-glutamic acid and 

10 7-ethyl -L-glutamate , non-degradable ethylene -vinyl 
acetate, a degradable lactic acid-glycol ic acid 
copolymer, and poly-D- ( - ) - 3 -hydroxybutyric acid. Such 
matrices are in the form of shaped articles, such as 
films, or microcapsules. 

15 In one embodiment, a sustained- release heregulin 

variant preparation includes a liposomally entrapped 
heregulin variant. Liposomes are small vesicles 
composed of various types of lipids, phospholipids, 
and/or surfactants. These components are typically 

20 arranged in a bilayer formation, similar to the lipid 
arrangement of biological membranes . Liposomes 
containing heregulin variants are prepared by known 
methods, such as, for example, those described in 
Epstein et al . , PNAS USA 82:3688-92 (1985), and Hwang 

25 et al . , PNAS USA 77:4030-34 (1980). Ordinarily the 

liposomes in such preparations are of the small (about 
200-800 Angstroms) unilamellar type in which the lipid 
content is greater than about 30 mol . percent 
cholesterol, the specific percentage being adjusted to 

30 provide the optimal therapy. Useful liposomes can be 
generated by the reverse -phase evaporation method, 
musing a. lipid composition including, for example, 
phosphatidylcholine , cholesterol , and PEG-derivat ized 
phosphatidylethanolamine (PEG-PE) . Liposomes are 

35 extruded through filters of defined pore size to yield 
liposomes with the desired diameter. 
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For treatment of neurologic diseases or disorders, 
a heregulin variant can be adsorbed onto a membrane, 
such as a silastic membrane, which can be implanted in 
proximity to damaged neural tissue, as described in 
5 WO 91/04014 . 

The dosage of a heregulin variant composition to 
be employed therapeutically depends, for example, upon 
the therapeutic objectives, the route of 
administration, and the condition of the patient. 

10 Accordingly, it is necessary for the clinician to titer 
the dosage and modify the route of administration as 
required to obtain the optimal therapeutic effect . A 
typical daily dosage can range from about 1 fig/kg to up 
to 100 mg/kg of body weight or more per' day, but is 

15 typically between about 10 fig/kg /day to 10 mg/kg/day. 

Generally, the clinician begins with a low dosage of a 
pharmaceutical heregulin variant composition and 
increases the dosage until the desired therapeutic 
effect is achieved. 

20 The administration of a heregulin variant of the 

invention can be combined with other therapeutic 
regimens . For the treatment of neurological 
conditions, a heregulin variant is optionally combined, 
or administered in concert, with another neurotrophic 

25 factor to achieve a desired therapeutic effect. For 

example, a heregulin variant can be used together with 
nerve growth factor (NGF) , a neurotrophin (e.g., NT-3, 
-4, or -5), bone-derived nerve factor (BDNF) , an 
insulin-like growth factor (e.g., IGF-1 or IGF-2) , 

3 0 gas6, or another neurotrophic factor to achieve a 

synergistic stimulatory effect on neurons. Suitable 
/ dosages for the neurotrophic factors do not differ from 
those known in the art for such molecules. 

For the treatment of cancer, radiation and/or a 

3 5 chemotherapeutic agent can be administered 

concomitantly with a heregulin variant. Suitable 
preparation and dosing schedules for such 
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chemotherapeutic agents are as recommended by the 
manufacturer or as determined empirically by the 
clinician. For preparation and dosing schedules for 
standard chemotherapeutic agents, see Chemotherapy 
Service (Perry ed . , Williams & Wilkins 1992). 
Administration of the chemotherapeutic agent can 
precede, or follow, administration of the heregulin 
variant, or the chemotherapeutic agent can be given 
simultaneously therewith. Antibodies against tumor- 
associated antigens, such as antibodies that bind EGFR, 
ErbB-2, ErbB-3, or ErbB-4 receptor, or vascular 
endothelial factor (VEGF) can also be co-administered 
with an heregulin variant, as can one or more 
cytokines . 

Non -Therapeutic Methods 

Heregulin variants according to the invention also 
be employed in a variety of non- therapeutic 
applications, such as cell culture methods and 
diagnostic methods. For example, a heregulin variant 
can be used to promote the ex vivo survival, 
proliferation, or differentiation of cells, such as 
glial, Schwann, and muscle cells. Cultures of such 
cells are useful for producing cell - specif ic factors, 
such as, for example, the nerve growth factor receptor 
(P75 NGFR ) which is a Schwann cell - specif ic factor. Cell- 
specific factors can be employed directly as diagnostic 
tools or employed to generate antibodies for diagnostic 
use . 

Ex vivo cell cultures can also be used as cellular 
prostheses for transplantation. For example, Schwann 
cell cultures can be transplanted into areas of damaged 
spinal cord to promote regeneration of interrupted 
central axons or can be used to assist the repair of 
peripheral nerve injuries. 

Accordingly, the present invention provides a cell 
culture method in which heregul in- responsive cells are 
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provided in a suitable cell culture medium. Suitable 
tissue culture media are well known to those skilled in 
the art and include, but are not limited to, Minimal 
Essential Medium (MEM), RPMI-1640, and Dulbecco's 
5 Modified Eagle's Medium (DMEM) . These tissue culture 
media are commercially available from Sigma Chemical 
Company (St. Louis, MO) and GIBCO (Grand Island, NY). 
The cells are cultured in the cell culture medium under 
conditions that permit the cells to grow in the 

10 presence of a heregulin variant. Suitable procedures 
for cell culture do not differ from known procedures 
and include, for example, liquid culture, culture in 
agar, and culture in a clot. 

The cells are cultured in the presence of an 

15 effective amount of a heregulin variant. The amount of 
heregulin variant can vary, depending on the cell type 
and cell culture conditions, but generally is in the 
range of about 10, ng/ml to about 1 mg/ml . An 
appropriate concentration for a given cell culture can 

20 readily be determined empirically by those skilled in 
the art . 

Techniques for culturing Schwann cells ex vivo are 
described in Li et al . (supra), and Sklar et al . 
(supra) describe ex vivo culture of clonal human 
25 myoblasts. A heregulin variant of the invention can 

replace the other heregulin polypeptides used in these 
methods . 

A heregulin variant can also be employed in the 
diagnosis of a cancer characterized by erbB (e.g., 

30 erJbB2) overexpression . In a diagnostic method 

according to the invention, a sample is obtained from • 
an individual and contacted with a heregulin variant 
under conditions that allow specific binding between 
the variant and any ErbB receptors present in the 

35 sample. The sample can be a tissue sample, a bodily 

fluid sample, or a cell. In the case of a solid tumor, 
a tissue sample can be taken from a surgically removed 
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tumor and prepared for testing by conventional 
techniques. In the case of lymphomas and leukemias, a 
sample containing lymphocytes, leukemic cells, or lymph 
tissues is obtained. Other samples, including samples 
5 of urine, tear drops, serum, cerebrospinal fluid, 
feces, sputum, cell extract, and the like, can be 
useful for diagnosing particular tumors. As used with 
regard the this method, the term "specific binding" 
means that the heregulin variant binds an ErbB receptor 

10 with an affinity that is sufficiently high that the 
heregulin variant does not substantially cross-react 
with other components present in the sample under the 
suitable reaction conditions. 

The amount of heregulin variant that specifically 

15 binds to the sample is determined as an indication of 
ErbB receptor content. For example, a tissue sample 
can be obtained from a primary tumor and used to 
prepare formal in- fixed , paraffin-embedded blocks. See 
Muss et al . , supra; 'Press et al . , Cancer Research 

20 54:2771-77 (1994). Tissue sections are then prepared 
according to known techniques. 

A heregulin variant is contacted with a tissue 
section under conditions that permit specific binding 
between the variant and ErbB receptors present in the 

25 section. Binding is generally detected using a label, 
such as a radioisotope, a fluorescent label, or an 
enzyme-substrate labelling system. The label can be 
conjugated directly to the heregulin variant, as 
described above. 

30 Alternatively, the label can be bound to the 

heregulin variant indirectly. For example, the label • 
$ can be ^conjugated to an ant i -heregulin variant antibody 
or conjugated to biotin or avidin and used with an 
anti-heregulin variant antibody conjugated to avidin or 

35 biotin (respectively) , as described generally in 

Current Protocols in Immunology (supra) . Selective 
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binding between biotin and avidin links the label to 
the heregulin variant. 

While in vitro analysis is normally contemplated, 
in vivo analysis using a heregulin variant conjugated 
5 to a suitable detectable label (e.g., In for imaging) 
can also be performed. See, e.g., U.S. Patent 
No. 4,938,948. 

A diagnostic method of the invention can be used 
in combination with other diagnostic/prognostic 

10 evaluations such as determining lymph node status, 
primary tumor size, histologic grade, estrogen or 
progesterone status, tumor DNA content (ploidy) , or 
cell proliferation (S-phrase fraction) . See Muss et 
al . , New Eng. J. Med. 330:1260-66 (1994). 

15 A heregulin variant according to the invention is 

also useful as a standard in assays for heregulins 
(such as a radioimmunoassay, an enzyme - 1 inked 
immunoassay, and a radioreceptor assay) , in an affinity 
purification technique 1 (e.g., for an ErbB receptor such 

20 as ErbB-3 or ErbB-4 receptor) , and in a competitive 

receptor binding assay. A heregulin variant can also 
be employed as an immunogen for generating anti- 
heregulin variant antibodies useful in detection and/or 
purification of heregulin variants. 

25 In addition to the applications discussed above, a 

heregulin variant that has an enhanced specificity for 
the ErbB-4 receptor, relative to the ErbB-3 receptor, 
can be used to bind and stimulate the ErbB-4 receptor 
preferentially over the ErbB-3 receptor. Such a 

30 variant is useful in studies designed to distinguish 

between the presence of ErbB-3 and ErbB-4 receptors in' 
*a sample or to determine whether particular biological 
actions are mediated by one or both receptors. For 
instance, in cell types having a relatively small 

35 number of ErbB-4 receptors and a large number of ErbB-3 
receptors, in the absence of receptor- specif ic 
heregulin variants, it is difficult to identify signal 
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transduction pathways activated by the different 
receptors and to link particular biological actions to 
one or both receptors. A variant that preferentially 
binds to the ErbB-4 receptor can be used to help 
5 elucidate the roles of the ErbB-3 and ErbB-4 receptors 
in heregulin action. 

Diagnostic Kits and Articles of Manufacture 

The present invention also provides kits useful in 

10 practicing the above-described methods. In one 

embodiment, the invention provides a diagnostic kit, 
i.e., a packaged combination of reagents for use in 
testing a sample. The components of the kit are 
typically provided in predetermined ratios. A kit for 

15 detecting ErbB receptors can include, for example, a 
heregulin variant labelled with a suitable label or a 
heregulin variant with a labelled reagent (s) for 
indirect labelling. If the label is an enzyme, the kit 
typically includes any substrate or cof actor required- 

20 by the enzyme label. Other additives, such as 

stabilizers, buffers and the like, can also be included 
in the kit. Kit reagents can be provided as dry 
powders, usually lyophilized, together with excipients 
for preparing kit reagent solutions of the appropriate 

25 concentration. Kits also typically include 

instructions for carrying out the assay for which the 
kit is designed. 

The present invention also provides an article of 
manufacture containing a pharmaceutical heregulin 

30 variant composition useful for the treatment of a 

disorder described above. The article of manufacture * 
, includes a container and a label . Suitable containers 
include, for example, bottles, vials, syringes, and 
test tubes. The container can be made from any of 

35 a variety of materials, such as glass or plastic and 
can have a sterile access port. The label on, or 
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associated with, the container indicates the disorder 
that the composition is to be used to treat. 

The article of manufacture can be a component of a 
kit that includes a second container including a 
5 pharmaceut ically- acceptable buffer, such as phosphate- 
buffered saline, Ringer's solution, or dextrose 
solution. The kit can also include other materials 
that are desirable from a commercial or user 
standpoint, such as other buffers, diluents, filters, 

10 needles, syringes," and package inserts with 
instructions for use. 

The following examples are offered by way of 
illustration and not by way of limitation. The 
disclosures of all citations in the specification are 

15 expressly incorporated herein by reference. 

EXAMPLE 1 
Determination of the Heregul in-(3 1 
Minimal EGF- Like Domain 
20 The smallest portion of HRG-/31 EGF that provides 

high-affinity receptor binding in the context of phage 
display was determined by preparing phagemid vectors 
that produced HRG-jSl 147-227, 147-244, 177-227, or 
177-244 fused to the C-terminus of M13 pill. These 
25 HRG-/31 EGF-like domain fragments were amplified from 

the vector pHL89 (which is described in Holmes, et al . , 
Science 256: 1205-10 [1992]) by PCR with primers having 
Nsil/Xbal-containing overhangs. 

These fragments were inserted into the phagemid 
30 display vector pam-g3 by restriction digest - ligation at 
the same sites to generate constructs pHRGl-g3 (177- . 
,227), pHRG2-g3 (17?-244), pHRG4-g3 (147-227), and 
pHRG5-g3 (147-244) . pam-g3 was a derivative of 
phGHam-g3, which was designed for phage display of 
3 5 human growth hormone (hGH) and was described in Lowman 
et al., Biochemistry 30 : 10832-38 (1991). pam-g3 was 
produced by removing the hGH gene present in phGHam-g3 
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and replacing this gene with a stuff er fragment, which 
provides spce for cleavage at the restriction sites 
used for cloning. 

pHRGl-g3 contained an Ala227Val mutation that had 
5 been introduced in generating the construct. Single- 
stranded uracil -containing template was produced from 
this construct and used in site -directed mutagenesis to 
restore Ala227, generating pHRG6-g3 (177-227) . In each 
of the above-described constructs, the HRG-/31 EGF-like 

10 domain fragment was attached to residue 247 of pill. 

In an effort to determine whether the inclusion 
of an extended flexible linker at the junction between 
HRG-01 EGF and pill would alleviate potential pill 
interference of binding to ErbB-3, two constructs 

15 having linkers at this junction were prepared from the 
pHRGl-g3 template. pHRG8-g3 expressed HRG-/31 177-228 
attached to pill 323 through a linker containing three 
consecutive GGGS (SEQ ID NO: 34) repeats, and pHRGll-g3 
expressed HRG-01 177-230 attached to pill 247 through a 

2 0 GGGSGGG (SEQ ID NO: 35) linkage. 

The HRG-/31 EGF-like domains expressed from the 
above-described constructs are designated herein by 
removing the "p" and the " -g3" that appear in the name 
of the construct. Thus, the HRG-/31 EGF-like domain 

25 expressed from the pHRG2-g3 construct is designated 
"HRG2 . " 

The domains were displayed monovalently on phage 
as pill fusion proteins, as described by Bass et al . , 
Proteins 8:309-14 (1990), and then analyzed for their 

30 binding to the high-affinity ErbB-2/3 receptor-Ig 

fusion (ErbB-2/3 - Ig) using the phage ELISA^ technique . 
described by Cunningham et al . , EMBO J. 13:2508-15 
(1994), with slight modifications. To produce phage 
displaying the domains, mutagenesis reaction mixtures 

35 were electrotransf ormed into XLl-Blue™ cells 

(Stratagene, Inc., La Jolla, CA) , according to the 
manufacturer's protocol. The transformed cells were 
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then infected with 10 11 plaque - forming units (pfu) 
M13K07 helper phage (Promega Corp., Madison, WI). 
Phage stocks (about 10 14 phagemid/mL) were prepared by 
precipitating culture broths from the cells after 
5 18-24 hours (h) of growth with 20% PEG ( 2 0 00 ) /2 . 5 M 

NaCl , according to the method of Sambrook and Maniatis, 
Molecular Cloning a Laboratory Manual Cold Spring 
Harbor Press 429 (1989) . The phage were resuspended in 
phosphate-buffered saline (PBS: 0.01 M sodium 

10 phosphate, 0.1 M NaCl, pH 7.5). 

ErbB receptor- Ig fusions where prepared using 
vectors that expressed the extracellular domain (ECD) 
of the ErbB-2, ErbB-3, or ErbB-4 receptor fused to a 
human IgG constant domain. First, a unique Mlul site 

15 was engineered into' a vector expressing a human IgG 
heavy chain (pDR2) at the region encoding the hinge 
domain of the immunoglobulin. Mlul sites were also 
engineered into a, set of ErbB expression vectors at the 
region encoding the 'EGD/transmembrane junctions of 

20 these receptors. All mutagenesis was performed 

according to the method of Kunkel et al . , Methods 
Enzymol. 154:367-82 (1987). 

The Mlul sites were used to make constructs that 
expressed the desired ErbB receptor- Ig fusion. The 

25 fusion junctions of the various receptor fusions were 
as follows (in order from the N- terminus to the 
C-terminus) : for ErbB-2, Glu646 of ErbB2 was fused to 
Thr-Arg-Asp-Lys-Thr (TRDKT; SEQ ID NO:36), which was 
fused to His224 of VH (H224™) ; for ErbB-3, Leu617 of 

30 ErbB-3 was fused to TRDKT - H 2 2 4 ^ ; for ErbB-4, Gly640 of 
ErbB-4 was fused to TRDKT - H 2 2 4 ^ . The ErbB receptor 
„ residue numbers are indicated according to the 
numbering system of Plowman et al . PNAS USA 90:174 6-50 
(1993) . The conserved TR sequence was derived from the 

35 Mlul site, and the conserved DKT sequence was derived 
from a linker. The final expression constructs had a 
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pRK-type plasmid backbone wherein eukaryotic expression 
was driven by a cyt omega lovirus (CMV) promoter. 

Receptor fusions were expressed from these 
constructs, purified, and allowed to form disulfide- 
5 linked dimers . Homodimeric ErbB-2, ErbB-3 and ErbB-4 

receptor- Ig fusions were produced by transfecting cells 
with the construct encoding the appropriate receptor 
fusion. Heterodimeric receptor fusions were generated 
by co- transfecting two expression vectors encoding 

10 different receptor fusions into the same cells. The 
resulting secreted receptor fusions were mixtures of 
two types of homodimers and the expected heterodimer. 

To express the receptor fusions, adherent HEK-293 
cells (ATCC No. CRL1573) were transfected with the 

15 appropriate expression vector (s) using the calcium 
phosphate precipitation method described by Gorman 
et al., DNA and Protein Eng. Tech. 2:3-10 (1990). 
Serum-containing medium was replaced with serum-free 
medium at 15 h post-transfection, and the transfected 

20 cells were cultured serum-free for 5-7 days. 

The resulting conditioned medium was harvested and 
passed through Protein A columns (1 mL Pharmacia 
HiTrap™, Piscataway, NJ) . Purified receptor fusions 
were eluted with 0.1 M citric acid (pH 4.2) into tubes 

25 containing 1 M Tris-HCl (pH 9.0). The eluted proteins 
were then dialyzed against PBS and concentrated using 
Centri-prep-30™ filters (Amicon, Beverly, MA) . 
Glycerol was added to a final concentration of 25%, and 
the preparations stored at -20°C. The receptor fusion 

30 concentration was determined via a Fc-ELISA. 

Microtiter plates (Nunc Maxisorp™ 96-well plates,. 
Inter Med, Denmark) for phage ELISA were prepared as 
follows. The wells were precoated overnight with 0.5 
fig of rabbit anti -human IgG (Fc gamma 

35 fragment-specific) antibodies (Jackson Immunoresearch, 
West Grove, PA) in 100 fiL* 50 mM NaC0 3 (pH 9.6) . The 
wells were blocked for 30 minutes (min) with 200 fxL PBS 
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containing 0.1% bovine serum albumin (BSA) and rinsed 
with wash buffer (PBS containing 0.05% Tween 2 0™). The 
wells were then coated with 0.1 fig ErbB-2/3-Ig in 
binding buffer (PBS, 0.1% BSA, 0.05% Tween 20™) for 
5 1 h, and washed again. 

Serial dilutions of soluble ErbB-2/3-Ig 
(competitor) and a concentration of phage predetermined 
to give 60% saturation (without competitor) were added 
to the wells in 100 fih binding buffer. Following 

10 incubation for 2 h at room temperature, plates were 

washed extensively and treated with a 1:900 dilution of 
anti-M13 horseradish peroxidase conjugate (Pharmacia, 
Piscataway, NJ) for 20 min. The amount of phage 
binding was determined by assaying horseradish 

15 peroxidase activity using o-phenylenediamine 

dihydrochloride) substrate solution (Sigma Chemical 
Company, St. Louis, MO). EC 50 values were calculated as 
the concentration of soluble ErbB-2/3-Ig required to 
compete half of the phage off the plate. 

20 The results are set forth in Table 3 , 

Table 3 

Initial Heregul in- Phage Variants 



2 5 Construct 



Heregulin-01 
residues 



Linker and pill 
fusion point 



ErbB-2/3-Ig 
EC S0 (nM)' 



HRG2 
HRG4 
HRG5 
HRG6 
3 0 HKG7 
HRG8 
HRG11 



177-244 

147-227 

147-244 

177-227 

177-228 

177-228 (1-52) 

177-230 



pill 247 
pill 247 
pill 247 
pill 247 
pill 247 
(GGGS) 3 -pIII 323 
GGGS GGG - p I I I 247 



6.0 
38 . 0 

4 . 7 
40 . 0' 
42 .0 
11 . 0 
19 , 0 



'Average of duplicate experiments 



35 All phage stocks bound specifically to immobilized 

ErbB-2/3-Ig and could be competed off with similar EC 50 
values (5-42 nM) . These values were about 100-fold 
higher than the previously measured dissociation 
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constant (K^) . Holmes, et al . , supra. However, EC S0 
values obtained from phage ELISA are often higher than 
the true particularly for high-affinity 

interactions. This may be due to the high receptor 
5 coat concentration required to give a reasonable signal 
for the bound phage, a low percentage of active 
receptor in competitor solutions, or interference from 
the linkage of the protein to pill. 

In any event, the HRG-/31 147-176 did not appear to 

10 enhance the binding of phage displaying the HRG-/31 

EGF-like domain to ErbB-3-Ig, whereas HRG-01 228-244 
contributed slightly to binding affinity. Thus, the 
minimal EGF-like domain of HRG-/31 was defined as 
HRG-01 177-228. 

15 Phage ELISA was also carried out for the 

constructs encoding HRG-/31 EGF-pIII fusions containing 
linkers inserted between HRG-/31 EGF and the pill 
fragment. Mild enhancements in binding affinity were 
observed for the 1 inker- containing fusions, along with 

20 increased expression of functional fusions, as 

determined by binding titrations of the phage stocks. 
The fusion of HRG-/31 EGF via a linker to pill 
residue 323, instead of residue 247 

(pHRG8-g3) resulted in a slight affinity enhancement. 
25 This construct was therefore used as the template 

vector for construction of phage display libraries. 

EXAMPLE 2 

Identification of Active Residues in the 
3 0 Heregulin-ffl EGF Domain by Alanine Scanning 

This example describes the identification of 
^active residues in the heregul in- /3l (HRG-/31 ) EGF-like 
domain (HRG-01 177- 1_?9) that play a role in the binding 
of HRG-/31 to the ErbB-3 and ErbB-4 receptors. Active 
35 residues were identified by mutating individual amino 
acids in this domain to alanine. The mutated domains 
(hereinafter "variants") were displayed monovalently on 
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phage as pill fusion proteins and variant affinities 
for ErbB-3 and ErbB-4 were determined by phage EL ISA . 
Selected variants were expressed as thioredoxin fusion 
proteins, which were also assayed for ErbB-3 and ErbB-4 
5 affinity. 

Alanine Scanning Mutagenesis and Phage Display 

Alanine -substituted variants were generated by 
site-directed mutagenesis according to Kunkel et al . , 

10 Methods Enzymbl. 154:367-82 (1987) (hereinafter "Kunkel 
mutagenesis") , using uracil -containing single - stranded 
DNA template prepared from pHRG2-g3. pHRG2-g3, which 
is described in Example 1, expressed HRG-01 177-244 
fused to pill 247. A series of oligonucleotides was 

15 used to generate a series of constructs that expressed 
a series of variants in which consecutive residues were 
mutated to alanine. Phage stocks were prepared from 
these constructs as described in Example 1, except that 
PEG (8000) was used to" precipitate the phage. The 

20 affinities of the alanine-substi tuted variants for 

ErbB-3 -Ig and ErbB-4-Ig was determined by phage ELISA 
as described in Example 1. The results are shown in 
Figure 2, which indicates the ratio of the EC S0 for each 
variant compared to the EC S0 for wild-type HRG-/31 177- 

25 244, also displayed on phage. In this plot, a ratio of 
one indicates that there was no difference in affinity 
for variant binding compared to wild-type HRG-/31 177- 
224, and a ratio of, e.g., five indicates that the 
variant bound the receptor with an affinity five-fold 

30 less than that of wild-type HRG-01 177-224. 

* Expression, Purification, and Assay of Soluble Alanine- 
Substi tuted Variants 

A number of alanine- substituted variants were 
35 expressed in soluble form as thioredoxin (Trx) fusion 
proteins. To prepare suitable expression vectors, a 
Trx expression vector was first generated from pET23a 
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(Novagen, Inc., Madison, WI). pET23a was digested with 
Ndel (which cuts at base 238) and Hindlll (which cuts 
at base 173), and a fragment encoding Trx was inserted. 
This fragment was obtained from pTrxFus (bases 2722- 
5 3180/ Invitrogen Corp., San Diego, CA) . The Ndel site, 
which includes the Trx translation start site, was then 
destroyed by cutting with Ndel and religating with 
Klenow. This removed the Ndel site, while retaining 
the Trx translation start. 

10 Vectors encoding HRG-01 alanine -substituted 

variants were initially generated by Kunkel mutagenesis 
in a pRKS.gDhrgBl vector (described in Gorman et al . , 
DNA Prot. Eng. Tech. 2:2-10 [1990]). The sequences 
encoding the variants could be cleaved from these 

15 vectors using Ndel and BamHI . To facilitate cloning of 
such fragments into the Trx expression vector, Kunkel 
mutagenesis was used to engineer a Kpnl site into the 
pRKS.gDhrgBl vector encoding wild-type HRG-/31 146-244 
immediately upstream of the Ndel site (at base 5407) . 

20 A KpnI-BamHI fragment encoding wild-type ,HRG-/5l 146-224 
was then cleaved from pRK5 . gDhrgBl and inserted to the 
Trx expression vector at Kpnl and BamHI cloning sites 
at the 3' end of the sequence encoding Trx. This 
introduced an Ndel site immediately downstream of the 

25 Kpnl site. In the resultant vector, the wild-type 

HRG-/31 sequence could be removed by digesting with Ndel 
and BamHI and replaced with an Ndel -BamHI fragment 
encoding a variant. The series of Trx-variant 
expression vectors thus obtained expressed Trx-variant 

30 fusions that contained an enterokinase protease 

recognition site ( DDDDK ; SEQ ID NO: 37) between the Trx 
4 and the variant sequences. 

Expression of Trx-variant fusion proteins was 
driven by the inducible T7 promoter from pET23a. 

3 5 Cloning, cell growth, and expression were carried out 
as described in the Novagen pET system manual . 
Briefly, cloning was done in XLl-Blue™ cells 
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(Stratagene, Inc., La Jolla, CA) and expression of 
soluble protein in BL21DE3 host cells (Novagen, Inc., 
Madison, WI) . BL21DE3 cells containing a Trx-variant 
expression vector were grown at 37°C n LB medium until 
5 the OD 550 reached 0.3-0.6. Expression of Trx-variant 
was then induced by addition of 0.4 nM i sopropy 1 - /3 -D - 
thiogalactopyranoside (IPTG) , and growth was allowed to 
continue for 2-4 h at 28°C. Cells were collected by 
centrifugation, resuspended in 0.02 M Tris-HCl, 0.025 M 

10 EDTA (pH 7.5) to a volume that was l/20th the cell 
culture volume. 

Cells were lysed by freezing on dry ice, thawing 
at 37°C, followed by vigorous sonication. The freeze, 
thaw, and sonication cycle was repeated three times. 

15 Protein was further solubilized in 6 M GdHCl , 0.1 M 
Tris-HCl (pH 8.8), sulfitolized by the addition of 
0.1 M Na 2 S0 3 , 0.2 M Na 2 S 4 ^0 6 , and stirred at room 
temperature for 1.5 h. Protein was dialyzed into 
0.05 M Tris-HCl (pH 7.5), 0.01 M methionine. After 

20 dialysis, the insoluble material was removed by 
centrifugation at 35K x g for 15 min . 

The supernatant was purified by Fast Flow Q 
Sepharose™ (Pharmacia, Piscataway, NJ) chromatography 
using a 15 -ml column equilibrated with 0.01 M Tris-HCl 

25 (pH 7.5) . Protein was eluted using a 0-2 M NaCl 

gradient with a flow rate of 5 mL/min. The Trx-variant 
fusions eluted between 0.5-0.6 M NaCl and were refolded 
overnight at room temperature after addition of 1 mM 
cysteine. The resultant preparation was dialyzed into 

30 0.05 M Tris-HCl (pH 7.5), 0.01 M methionine. Trx- 
variant fusions were found to be essentially 
^homogeneous as determined by amino acid analysis and 
SDS-PAGE. 

The affinities of the Trx-variant fusions for ErbB 
35 receptor- Ig fusions were determined by measuring 

inhibition of 125 I-HRG-£l 177-244 binding to ErbB-3-Ig 
and ErbB-4-Ig. Receptor fusions were coated on plates 
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(Nunc Maxisorp C™ break-apart strip wells, Inter Med, 
Denmark) via anti -human IgG, as described in Example 1 
for phage ELISA. Binding assays were carried out with 
a constant amount of 125 I-HRG-/3l 177-214 (100-300 pM) 
5 and varying concentrations (100 pM - 4 /xM) of unlabeled 
Trx- variant fusion. Following incubation for 1-3 h at 
room temperature, plates were washed, and the amount of 
bound 125 I-HRG-/?1 177-244 in each well was counted on a 
gamma counter (Isodata, ICN Biomedic Systems, 

10 Huntsville, AL) . For the ErbB-3 binding assays, the 
blocking buffer was TBST (0.025 M Tris-HCl [pH 7.5], 
0.15 M NaCl, 0.02% Tween 20™) containing 1% BSA; the 
binding buffer was RPMI 1640™ cell culture media 
(Gibco-BRL, Gai thersburg , MD) containing 2 mM 

15 glutamine, 100 U/mL penicillin, 100 izg/mL streptomycin, 
10 mM HEPES buffer (pH 7.2), 0.2% BSA; and the wash 
buffer was TBST. For the ErbB-4 binding assays, PBS 
containing 1% BSA, was used as the blocking and binding 
buffers, and the wash 1 buffer was PBS containing 0.05% 

20 Tween 20™. 

The results are shown in Tables 4 and 5 . 
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Table 4 

EC 50 Values for Phage and Soluble Alanine -Substituted 
Variant Binding to the ErbB-3 Receptor 

5 

Variant' Phage EC C . (nM) Sol EC Crt (nM) 





HRG-01* 


13 


9 




TH-WT® 




47 . 6 


10 


S177 


67 


129 




H178 


>1000 


476 




L179 


-1000 


394 




F189 


892 


975 




N192 


>1000 


1642 


15 


G193 


>1000 


26 . 9 




G194 


500 


1212 




E195 


>1000 


42 




VI 9 9 


27 






K200 


20 


45 . 4 


20 


P205 


22 


39 . 5 




R207 


>1000 


245 




K211 


>1000 


248 




E215 


51 


156 




F216 


500 ~ 


2550 


25 


T217 


>1000 


145 




G218 


>1000 


668 




R220 


>1000 


no binding 




Y224 


865 


339 




F229 


6 


177 



30 

'Variants are identified by the HRG-01 residues mutated 
4 to alanine . 

*HRG~B1 is recombinantly produced HRG-/31 177-244. 

3 5 

®TH-WT is thioredoxin fused at its C-terminus to wild- 
type HRG-/31 146-244. 
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Table 5 

FC 50 Values ror Phage and Soluble Alanine -Subs ti tu ted 



5 


Variant 


Binding to the 


ErhB - 4 


.Receptor 




Variant* 


Phacre EC, n 


(nM) 


Sol'ECc ft (nM) 




HRG-01* 


19 




14 . 8 




TH-WT* 






15 . 4 


10 


S177 


83 




54 . 6 




H178 


32 




138 




L179 


56 




51 . 7 




F189 


106 




565 




N192 


188 




696 


15 


G193 


23 




76 . 3 




G194 


65 




275 




E195 


24 




13 . 2 




VI 9 9 






246 




IX \J \J 


2 5 




16.9 


20 


P205 


8 




34 . 2 




R207 


30 




24 . 3 




K211 


59 




124 




E215 


32 




104 




F216 


>1000 




173 


25 


T217 


14 . 2 




32 . 7 




G218 


>1000 




608 




R220 


>1000 




no binding 




Y224 


51 




24 . 5 




F229 


>1000 




89 . 9 



3 0 ' . 

'Variants are identified by the HRG-/S1 residues mutated 
# to alanine. 

*HRG-B1 is recombinantly produced HRG-01 177-244. 

2 5 

®TH-WT is thioredoxin fused at its C-terminus to wild- 
type HRG-01 146-244. 
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EXAMPLE 3 

Selection of Herecrulin-fll EGF Domain Variants 
Using Monovalent Phaa e Display 

This example describes the selection HRG-01 
variants containing residues corresponding to the 
minimal EGF-like domain (HRG-/51 177-228) . For these 
variants, residue numbers also are expressed, in 
parentheses, in terms of the position of the residue in 
the minimal EGF-like domain (i.e., HRG-01 EGF 1-52). 

Variants of HRG-01 EGF were prepared and selected 
for binding to ErbB-3-Ig using monovalent phage 
display, according to the method of Bass et al . , 
Proteins 8:309-14 (1990). As discussed in detail 
below, an HRG-01 EGF phagemid vector was prepared, in 
which HRG-/31 EGF was fused to- a C- terminal fragment of 
the M13 coat protein pill . Kunkel mutagenesis was 
performed to introduce stop codons into this vector at 
sites selected for ran'domizat ion . This step ensures 
that the starting vector is incapable of expressing the 
wild-type polypeptide. Stretches of four to six 
residues per library were randomized in a linear 
fashion, except for the six cysteines, Phel89 (HRG-/31 
EGF Phel3) and the two most C-terminal residues (see 
Figure 3) . Phel89 was not altered because this residue 
is conserved as an aromatic residue in EGF and TGF-a 
and forms a stacking interaction with.Tyr208 (HRG-/31 
EGF Tyr32) Jacobsen et al . , Biochemistry 
35:3402-17 (1996). HRG-/31 EGF was thus covered in 
eight libraries, designated A-E, G, H and I. 

Library E, covering HRG-/31 202-209 (HRG-/31 
* EGF 26-33), contained a three-residue deletion. The 
deleted region corresponds to a disordered turn between 
the second and third 0-sheet of HRG-01 EGF, and the 
equivalent amino acids are absent in EGF and TGF-a. An 
HRG-/31 EGF control variant in which HRG-01 202-204 
(HRG-01 EGF 26-28) of HRG8 are deleted (HRG63 ) bound 
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ErbB-3-Ig with an affinity similar to that of wild-type 
(Table 13) . 

An additional library (F) was created to randomize 
a surface patch composed of side chains from the first 
5 and second /3-sheets, which included HRG-01 178, 180, 
198, and 200 (HRG-01 EGF 2, 4, 22, and 24). 

The selected sites in the starting vectors were 
randomized by Kunkel mutagenesis to produce 
HRG-01 EGF libraries. Phage displaying mutated HRG-/31 

10 EGFs were produced from the libraries under conditions 
such that, statistically, each phage particle displayed 
no more than one copy of the mutated HRG-/31 EGF. See 
Bass et al . , supra. These phage were then selected for 
binding to (sorted against) ErbB-3-Ig immobilized on an 

15 ELISA plate. Bound phage were eluted and used to 

reinfect host cells, which were used to produce new 
phage for another round of sorting. This process was 
repeated six to seven times for each library. Twelve 
clones from the phage selected from each library were 

20 then sequenced. 

Construction of Phage Libraries 

Phage libraries, were constructed by Kunkel 
mutagenesis using uracil -containing single - stranded DNA 
25 template prepared from pHRG8-g3. pHRG8-g3, which is 
described in Example 1, expressed HRG-/31 177-228 
(HRG-/31 EGF 1-52) fused via a linker to pill 
residue 323. For each library, TAA and TGA stop codons 
were installed at positions selected for randomization 
30 to generate custom templates that eliminated wild-type 
background from the pools. Positions were "fully 
^randomized by mutation to NNS codons (where N is any of 
the four bases and S is either G or C) . One 
oligonucleotide was used for each library mutagenesis 
3 5 reaction except for Library F, for which two 

oligonucleotides (one randomizing HRG-/31 178 and 180 
[HRG-/31 EGF 2 and 4] and the other randomizing HRG0-1 
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198 and 200 [HRG-/31 EGF 22 and 24] ) , were used 
simultaneously . Mutagenesis oligonucleotides contained 
18 -base overhangs on either side of the randomized 
residues . 

5 Mutagenesis reaction mixtures were electro- 

transformed into XL-1 blue cells (Stratagene, Inc., 
La Jolla, CA) , according to the manufacturer's 
protocol. The transformed cells were then infected 
with 10 11 pfu M13K07 helper phage (Promega Corp., 

10 Madison, WI) , and phage stocks (about 10 14 phagemid/mL) 
were prepared as described in Example 1 . 

Between 1.0 x 10 8 and 6.4 x 10 8 transf ormants were 
obtained for each library, meaning that the libraries 
containing five or fewer randomized codons had 

15 excellent representation of the possible amino acid 
sequence combinations (3.36 x 10 7 possible DNA 
sequences; 3.2 x 10 6 possible amino acid sequences). 
Library B, containing six randomized codons (1.1 x 10 9 
DNA sequences), had 4.8 x 10 e total transf ormants . 

20 

Selection of Phage for ErhB-3-Ig Binding 

Monovalent phage were prepared and the selection 
performed on ErbB-3-Ig prebound to microtiter plates 
via capture with polyclonal antibodies to the human Fc 
25 fragment, as described in Example 1. Approximately 10 12 
phage in 100 /iL binding buffer (PBS, 0.1% BSA, 0.05% 
Tween 20™) were applied to an ErbB - 3 - Ig-coated well and 
a control well to which no ErbB-3-Ig had been added. 
Following a 2 h incubation at room temperature, the 
30 plates were washed extensively (12x) and phage eluted 

by adding 100 pL, of a solution of 50 mM HCl and 0.05% ' 
/Tween™ 20 and shaking for 10 min. 

Eluates were neutralized with 10 /iL 1 M Tris-HCl 
(pH 8.0) and 20 /iL used for titration on log-phase XL-1 
35 blue cells. The remainder was used to infect 1 mL of 
log-phase XL-1 blue cells (30 min at 37*C) , which were 
then superinf ected with 2 x 10 10 pfu M13K07 phage and 
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grown in 25 mL 2YT broth (16 g/L tryptone, 10 g/L yeast 
extract, 5 g/L NaCl) containing 50 /ig/mL carbenicillin 
for 18-24 h. Phage were harvested as described above 
and the cycle repeated. The libraries enriched 
5 rapidly, such that by round six of selection the ratio 
of phage eluted from positive (ErbB- 3 - Ig-coated) wells 
to negative (anti-hu Fc precoat only) wells was between 
39 and 9200. 

After round six (libraries A, B, D-F) or round 

10 seven (libraries C, G-I) of selection, twelve clones 

from each library were randomly picked and sequenced by 
the dideoxy method. See Sanger et al . , PNAS USA 
74:5463-67 (1977). The amino acids at the randomized 
positions deduced from the DNA sequences are shown in 

15 Tables 6-12 (a < indicates a residues that is 

identical to the wild-type reside) . The consensus 
selected residues at each position are displayed 
graphically in Figure 3. In general, there were a 
large number of muta'tions, in some cases with dramatic 

20 changes in the character of the side chains. At 

several positions that sorted to a particular residue, 
a mixture of DNA codons was found, providing confidence 
that the libraries had large diversity and that 
selection was at the protein level . In several of the 

25 libraries, there was a spontaneous mutation of HRG-£l 
Met226Ile (HRG-01 EGF MetSOIle) due to a one-base 
change in this codon . This mutation results in a 
significant affinity enhancement for ErbB-3-Ig binding. 
The sequencing results for each library are summarized 

3 0 below. 

, Library^ A - HRG-01 177-181 (HRG-01 EGF 1-5) 

HRG-01 177-181 (HRG-01 EGF 1-5) is present in the 
first /3-strand in the wild- type N-terminal subdomain. 
3 5 The amino acid changes in the variants selected from 
this library are shown in Table 6. 
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Table 6 
Library A Variants 



Position in HRG-/31 





Variant No. 


177 


178 


179 


180 


181 








5 


Wild- type 


S 


H 


L 


V 


K 


(SEQ 


ID 


NO : 3 8 ) 




1 


W 


R 






P 


(SEQ 


ID 


NO : 3 9 ) 




2 


w 


S 




0 


P 


(SEQ 


ID 


NO : 4 0 ) 




3, 5, 10 


w 


E 






P 


(SEQ 


ID 


NO : 4 1 ) 




4 


w 


S 








(SEQ 


ID 


NO : 4 2 ) 


10 


6 


w 


s 




I 


P 


(SEQ 


ID 


NO : 4 3 ) 




7 


w 


R 






A 


(SEQ 


ID 


NO: 44) 




8 


w 


A 






P 


(SEQ 


ID 


NO : 4 5 ) 




9 


w 


S 




Q 




(SEQ 


ID 


NO : 46) 




11 


w 


E 






A 


(SEQ 


ID 


NO : 4 7 ) 


15 


12 


w 


S 




E 


P 


(SEQ 


ID 


NO : 4 8 ) 



Upon randomization, Serl77 (Serl) sorted exclusively to 
Trp. Hisl78 (His2) sorted to mixed hydrophilic 
residues, but the wild-type residue was not among them. 

20 Leul79 (Leu3) was conserved in all variants sequenced. 
Vall80 (Val4) sorted to wild-type in eight out of 
12 variants, and the remaining variants had 
conservative substitutions at this position, with the 
exception of a Vall80Glu (Val4Glu) mutation. At Lysl81 

25 (Lys5) , Pro appeared in eight variants, and the 
wild-type residue was found in two variants. 

Four variants from library A did not contain amino 
acid substitutions at positions randomized in 
library A. Instead, these variants contained amino 

30 acid substitutions at positions randomized in 

* library. B, and thus these variants are listed as 
variants B1-B4 in Taole 7. Similarly, four variants 
from library B did not contain amino acid substitutions 
at positions randomized in library B, but rather 

35 contained substitutions at positions randomized in 
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library A. These variants are listed as variants A5-A8 
in Table 6 . 

5 Library B - HRG-01 183-188 (HRG-&1 EGF 7-12) 

HRG-01 183-188 (HRG-01 EGF 7-12) has a helical 
character in the wild- type protein. The amino acid 
changes in the variants selected from this library are 
shown in Table 7. 

10 

Table 7 
Library B Variants 



Position in HRG-/31 



15 


Variant No. 


183 184 


185 


186 


187 


188 












Wild- type 


A 


E 


K 


E 


K 


T 


(SEQ 


ID 


NO: 


49) 




1* 


G 


V 


G 


R 


D 


G 


(SEQ 


ID 


NO: 


50) 




T 


' G 


G 


E 


R 


E 


G 


(SEQ 


ID 


NO: 


51) 


20 


3 


G 




E 


R 


E 


G 


(SEQ 


ID 


NO: 


52) 




4\ 5* 


G 


W 


D 


R 


E 


G 


(SEQ 


ID 


NO: 


53) 




6* 


G 


V 


Q 


R 


E 


G 


(SEQ 


ID 


NO: 


54) 




7 


G 




E 


R 


A 


G 


(SEQ 


ID 


NO: 


55) 




8 


G 


K 


E 


R 


E 


G 


(SEQ 


ID 


NO: 


56) 


25 


9" 


T 


N 


S 


R 


E 


G 


(SEQ 


ID 


NO: 


57) 




10* 


D 


K 


S 


R 


E 


G 


(SEQ 


ID 


NO: 


58) 




11* 


G 




D 


R 




Q 


(SEQ 


ID 


NO: 


59) 




12 


G 


R 


E 


R 


E 


G 


(SEQ 


ID 


NO: 


60) 


30 


"Variant also 


contained 


Met226Ile. 















Randomization of this region produced the most dramatic 
, changes from the wild- type sequence, although the 
generally hydrophilic character of this region was 
35 maintained in the variants sequenced. In particular, 
this six-residue stretch sorted to Gly residues at the 
first and last positions, Alal83 and Thrl88 (Ala7 and 
Thrl2) . There was also a change in registry of 
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positive and negative charges at Lysl85 (Lys9) , which 
sorted to Glu and Asp, among others; Glul86 (GlulO), 
which sorted exclusively to Arg; and Lysl87 (Lysll) , 
which sorted to Glu and Asp. Gj.u184 (Glu8) sorted to a 
5 variety of different types of residues, indicating that 
this side chain does not play an important role in 
ErbB-3 receptor binding. 

Library C - HRG-01 191-195 (HRG-&1 EGF 15-19) 
10 HRG-)8l 191-195 (HRG-/S1 EGF 15-19) includes the 

/3-turn between the helix and the second jS-strand. The 
amino acid changes in the variants selected from this 
library are shown in Table 8 . 

15 Table 8 

Library C Variants 

Position in HRG-/31 
Variant No. ' 191 192 193 194 195 



20 Wild-type V N G G E (SEQ ID NO: 61) 

1, 1, 4, 5, 7-12 (SEQ ID NO:62) 

3 .... V (SEQ ID NO: 63) 

6 ■ Q (SEQ ID NO: 64) 

25 

Randomization and selection produced variants in which 
the wild-type amino acid sequence in this region was 
almost completely conserved. Mutations were found in 
only two variants, both at Glul95 (Glul9) . This result 
30 is consistent with an important role for the wild-type 
residues in this region in ErbB-3 receptor binding. 
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Library D - HRG-/31 197-201 (HRG-fBl EGF 21-25) 

HRG-01 197-20^ (HRG-01 EGF 21-25) is present in 
the second /?-strand in the N-terminal subdomain. The 
amino acid changes in the variants selected from this 
5 library are shown in Table 9. 



Table 9 
Library D Variants 



Variant No . 


197 


198 


Position in 
199 200 201 


HRG- 


01 






Wild- type 


F 


M 


V K 


D 


(SEQ 


ID 


NO: 


65) 


1\ 2\ 8*, 12* 


Y 


K 


R 


I 


(SEQ 


ID 


NO: 


66) 


3 




R 




T 


(SEQ 


ID 


NO: 


67) 


4, 5, 7, 9 


Y 


R 




T 


(SEQ 


ID 


NO: 


68) 


6 


Y 




I 


Y 


(SEQ 


ID 


NO: 


69) 


10 


Y 






T 


(SEQ 


ID 


NO: 


70) 


11 


M 


R 


R 


T 


(SEQ 


ID 


NO: 


71) 



20 * . 

"Variant also contained Met226Ile. 

Randomization in this region yielded either a gain or 
loss of charge for two of the five residues. Phel97 

25 (Phe21) sorted to Tyr in 10 out of 12 variants, 

maintaining aromaticity at this position. Metl98 
(Met22) sorted to a positively charged residue in 11 
out of 12 variants. Vall99 (Val23) was conserved in 
all variants, and Lys200 (Lys24) sorted either to 

30 wild-type or to Arg, retaining the positive charge at 
this position. Asp201 (Asp25) sorted to uncharged 
residues Thr or lie, retaining the /3-branch character 
of this position. Variants Dl , D2 , D8 , D12 also 
' included the spontaneous affinity- enhancing Met226Ile 

35 (Met50Ile) mutation. 
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Library E - HRG-pl 205-209 (HRG-/31 EGF 29-33) 

HRG-/S1 205-209 (HRG-/S1 EGF 29-33) includes 
residues present in the third 0- strand in the 
N- terminal subdomain. The amino acid changes in the 
5 variants selected from this library are shown in 
Table 10. 



Table 10 
Library E Variants 

10 





Variant No. 


205 


206 


Position 
207 208 


in 
209 


HRG- 


01 








Wild- type 


P 


S 


R 


Y 


L 


(SEQ 


ID 


NO: 


72) 


15 


1 




T 


P 


Y 


L 


M 


(SEQ 


ID 


NO: 


73) 




2, 


4 


Y 


G 


Y 


L 


M 


(SEQ 


ID 


NO: 


74) 




3* 




Y 


R 


Y 


R 


M 


(SEQ 


ID 


NO: 


75) 




5, 


12 


T 


H 


Y 


R 


G 


(SEQ 


ID 


NO: 


76) 




6 




T 


H 


Y 


R 


M 


(SEQ 


ID 


NO: 


77) 


20 


T 




Y 


K 


Y 


R 


M 


( SEQ 


ID 


NO: 


78) 




8, 


9 


T 


K 


Y 


R 


G 


(SEQ 


ID 


NO: 


79) 




10 




Y 


K 


Y 


R 




(SEQ 


ID 


NO: 


80) 



11* 

25 

'Variant also contained Met226Ile. 

"Variant Ell was a contaminant from library F (identical 
to the other 12 library F variants) . 

30 

Upon randomization, Pro205 (Pro29) sorted to Thr or 
Tyr. Ser206 (Ser30) sorted to mixed residues, 
predominantly those having basic side chains, although 
Gly also appears twice (in sequences derived from the 

35 * same variant) . An inversion of side chains occurred 
for Arg207 (Arg31) and Tyr208 (Tyr32) , the first of 
which sorted exclusively to Tyr and the second of which 
sorted primarily to Arg (seven variants) and Leu (four 
variants) . This finding was particularly unexpected, 

40 given that Tyr208 stacks with Phel89 (Phel3) in the 
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structure and is conserved in the EGF sequence. See 
Jacobsen et al . , Biochemistry 35:3402-17 (1996). At 
Leu209 (Leu33) , the relatively conservative Met 
substitution was found in the majority of variants, but 
5 Gly was also found. 



Library G - HRG-(3l 211-216 (HRG-/31 EGF 35-40) 

HRG-01 211-216 (HRG-/31 EGF 35-40) includes the 
first /5-strand of the C-terminal subdomain of HRG-01 
10 EGF. The amino acid changes in the variants selected 
from this library are shown in Table 11. 

Table 11 
Library G Variants 

15 

Position in HRG-/31 





Variant No . 


211 


212 


213 


214 


215 


216 












Wild- type 


K 


C 


P 


N 


E 


F 


(SEQ^ 


ID 


NO: 


81) 


20 


1, 


5, 6, 10, 12 


R * 




S 


L 






(SEQ 


ID 


NO: 


82) 




2 




R 




s 


E 






(SEQ 


ID 


NO: 


83) 




3 










K 




M 


(SEQ 


ID 


NO: 


84) 




4 




R 




T 


V 




Y 


(SEQ 


ID 


NO: 


85) 




7 , 


8 


R 




T 


V 




Y 


(SEQ 


ID 


NO: 


86) 


25 


9 








N 


S 






(SEQ 


ID 


NO: 


87) 




11 




R 




K 


K 






(SEQ 


ID 


NO: 


88) 



Upon randomization, 10 out of 12 variants contained a 

Lys211Arg (Lys35Arg) mutation, thus retaining a 
30 positive charge at this position, which lies between 

two cysteines. Pro213 (Pro37) sorted to mixed 

hydrophilic residues, and Asn214 (Asn38) sorted to a 
'mixture- of residues, with Leu and Val appearing most 

frequently. Glu215 (Glu39) was conserved in all 
35 variants, and Phe216 (Phe40) was retained in eigl.t out 

of 12 variants with a conservative Tyr substitution in 

three of the remaining variants. 
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Library H - HRG-01 217-220 (HRG-/31 41-44) 

The HRG-01 217-220 (HRG-01 EGF 41-44) library 
proved vulnerable to contamination by a high-affinity 
variant from library B (variant B5) . This variant was 
5 found in 11 out of 12 variants. In the single 

unaffected variant, the wild-type amino acid sequence 
was conserved except for an Asp219Glu (Asp43Glu) 
mutation. This result suggests that this region 
requires the wild-type or similar sequences for optimal 
10 binding. 

Library I HRG-fSl 222-226 (HRG-/31 EGF 46-50) 

HRG-/31 222-226 (HRG-/31 EGF 46-50) includes a short 
strand of /3-sheet that aligns with the strand of 
15 /3-sheet at HRG-/31 213-216 (HRG-01 EGF 38-40) . The 

amino acid changes in the variants selected from this 
library are shown in Table 12. 

* . Table 12 

2 0 Library I Variants 

Position in HRG-01 

Variant No. 222 223 224 225 226 

Wild-type Q N Y V M (SEQ ID NO:89) 

1, 4, 8 W . . I (SEQ ID NO:90) 

2, 3, 5-7, 9-12 H . . I (SEQ ID NO:91) 

Upon randomization of HRG-01 222-226, only two types of 

3 0 variants were found in the 12 sequenced, both having 

wild-type residues conserved at Gln222 (Gln46) , Tyr224 
(Tyr4 8) , and Val225 (Val4 9) , and both having a 
, Met226Ile (Met50Ile) mutation. Asn223 (Asn47) sorted 
to His (nine variants) or ^rp (three variants) . The 
35 strong affinity-enhancing effect of Met226Ile is 

evidenced by its presence in all variants sequenced 
from this library and a high frequency of occurrence in 
variants from several other libraries. 
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Library F - HRG-/31 178, ISO, 198, and 200 (HRG-&1 EGF 
2, 4, 22 and 24) 

When Hisl78 (His2), Vall80 (Val4), Metl98 <Met22), 
and Lys200 (Lys24) were simultaneously randomized, onl> 
5 one type of variant was found. Hisl78, Vall80, and 

Lys200 sorted to wild-type residues, and Metl98 sorted 
to Lys . These variants additionally contained the 
spontaneous Met226Ile (Met50Ile) mutation, which gave 
the variants a significant selective advantage over 
10 other sequences. It is striking that the wild- type His 
was found at position 178 because none of the 12 
variants sequenced from library A contained Hisl78 . 

Conservation of HRG-/31 EGF Residues in Phage Display 

15 and Alanine Scanning Results 

Positions where alanine substitution strongly 
affected binding affinity tended to sort to the wild- 
type residue. Sea, for example, the data for positions 
at the junction between the N- and C-terminal 

20 subdomains, (i.e., the /3-turn at HRG-/31 1-91-195 [HRG-01 
EGF 15-19] and the loop at HRG-£l 217-220 [HRG-01 EGF 
41-44] ) Additionally, positions where alanine 
substitution produced less significant effects tended 
to undergo substantial mutation upon phage display, as 

25 seen for the helical stretch at HRG-/81 183-188 (HRG-/S1 
EGF 7-12) . 

Analysis of the Impact of Selected Mutations on 

Receptor Affinity and Specificity 
30 Individual variants from each library were chosen 

for phage production and further characterization of 
, the mutated HRG-01 EGFs ("variants") displayed on the 

phage. The choice of variants for further 

characterization was based on selection frequency, with 
3 5 a bias towards sequences not containing the 

advantageous Met226Ile (Met50Ile) substitution. In 

addition, phagemid vectors for use in producing phage 
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displaying mutated HRG-01 EGFs containing various 
combinations of the above mutations ("combination 
variants") were prepared by Kunkel mutagenesis, using 
templates prepared from several of the above -described 
5 variants. The amino acid substitutions in the 

combination variants are indicated in Figure 4. The 
affinities of the variants and combination variants for 
binding to ErbB-3-Ig (relative to the affinity of wild- 
type HRG-/31 EGF) were determined by phage ELISA, as 
10 described in Example 1 . The results are shown in 
Table 13 . Some of the variants and combination 
variants were also tested for relative ErbB-4-Ig 
affinity to assess specificity. 
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Table 13 

Affinities of Heregulin-01 EGF Variants and 
Combination Variants for ErbB-3-Ig and ErbB-4-Ig 
as Determined by Phage ELISA 



Phage ErbB-3 Phage ErbB-4 





Construct 


EC S0 (wt) /EC 50 (mut) * 


EC 50 


(wt)/EC 50 




HRG8 


1 


1 






HRG6 3 


1.1 ± 0.8 






10 


Al 


0.55 ± 0.27 








A2 


0.87 ± 0.5 








A3 


0.96 ± 0.43 


1 . 1 


± 0.4 




A4 


<0 . 3 








B3 


4.7 ± 0.77 


1 . 7 


+ 0.5 


15 


B5 


26 ± 20 








BIO 


9 . 3 








Dl 


11 ± 0.45 








D4 


2.9 ± 1.1 


5 . 0 


±3.7 




D10 


2 . 1 






20 


E2 


28 ± 13 


7 . 6 






E3 


16 








E6 


6.6 ± 3.0 ' 








E8 


17 ± 2.3 








Gl 


1.2 ± 0.36 


1 . 6 




25 


G4 


0.86 ± 0.19 








H5 


1 . 5 








11 


13 ± 11 








12 


20 ± 20 


10 






Fl 


4.4 ± 3.3 






30 


HRG9 0 


6.3 ± 3.1 








4 HRG3 7 ' 


13 + r 








HRG3 8 


<0 . 3 








HRG4 0 


<0 . 3 








HRG41 


<0 . 3 






35 


HRG4 8 


49 ± 15 







-84- 



Li! o n h K J - --«■ .... o f a j. o o f* 



405519 . ' 

Phage ErbB-3 Phage ErbB-4 

Construct EC S0 (wt ) /EC S0 (mut)' EC S0 (wt) /EC S0 (mut) 



HRG5 3 


26 


± 


16 


HRG54 


12 


± 


9 . 2 


HRG5 5 


13 


± 


11 


HRG5 6 


31 


± 


22 


HRG57 


24 


± 


16 


HRG5 8 


58 


± 


11 


HRG5 9 


26 


± 


14 


HRG6 0 


63 


± 


11 


HRG61 


29 


± 


25 


HRG6 2 


32 


± 


14 


HRG71 


79 


± 


56 


HRG73 


56 


± 


6 . 6 



'Based on the wild-type HRG-/31 EGF-phage EC 50 
15 (=135 ± 104 nM for ErbB-3~Ig, 163 ± 112 nM for 

ErbB-4 -Ig) determined on during the same assay run. 
Shown are standard deviations for averages of 2-4 
duplicate runs, or the average only for assays 
performed in duplicate for one run. 

20 

Variants from library A had EC 50 values very 
similar to wild-type HRG-01 EGF . Variants from 
libraries B and D had significantly enhanced affinity 
for ErbB-3-Ig and ErbB-4-Ig, in the range of three to 

25 five times the wild-type affinity for selected variants 
not containing Met226Ile (MetSOIle) . (See variants B3 
and D4 . ) Substantially greater enhancements in 
affinity (up to 26-fold wild-type HRG-01 EGF) were 
measured for variants that contained Met226Ile 

30 (MetSOIle) . (See variants B5 , BIO, and Dl). 

, Variants from library E showed even greater 

affinity enhancements. For example, variant E2 , which 
differed from the library consensus sequence (Figure 3) 
at HRG-/31 206 and 208 (HRG-01 EGF 30 and 32), had an 

35 affinity enhancement of 28-fold. The effects appear to 
be associated with the amino acid substitutions at 
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HRG-/31 205-209 (HRG-01 EGF 29-33) rather than the 
adjacent three - residue deletion, since the affinity for 
ErbB-3-Ig of a A202-204 (A26-28) control variant 
(HRG63) was similar to that of wild-^ype HRG-/31 E3F 
5 (Table 13) . 

The single variant from library F, having 
Metl98L,ys (Met22Lys) and Met226Ile (MetSOIle) 
substitutions, showed an enhancement only slightly 
above that for Met226Ile alone, indicating a small 

10 effect attributable to the Metl98Lys mutation. 
Variants from library G showed little, if any, 
ErbB-3-Ig affinity enhancement. The single Asp219Glu 
(Asp43Glu) mutation from the lone library H variant 
provided a modest increase in affinity. The two 

15 library I variants had significant enhancements in 

affinity for ErbB-3-Ig and ErbB-4-Ig. The enhancements 
were attributable to an approximately six-fold effect 
from the Met226Ile (Met50Ile) mutation (see variant 
HRG90) , and an additional enhancement of about two- to 

20 three-fold from Asn223 (Asn47) to Trp or 'His mutations. 
Phage displaying combination variants containing 
variant A3 mutations performed poorly. The A3 + B3 
combination had an EC 50 for ErbB-3-Ig similar to that of 
variant B3 , but the other combinations tested did not 

25 bind with detectable affinity. This could be due to a 
disadvantageous interaction of the SerlWTrp, 
Hisl78Glu, or Lysl81Pro (SerlTrp, His2Glu, or LysSPro) 
mutations with mutations from the other libraries. The 
combination of the B3 and E2 mutations yielded slightly 

30 diminished affinities relative to that of variant E2 . 

The combination of mutations from variants from * 
, the other libraries gave closer to additive behavior 
and resulted in the^ enhancement of EC 50 values by 
greater than 50-fold. The EC 50 values for the best 

35 combination variants were close to the lower limit of 
the assay at the receptor coating concentration used 
(approximately 4 nM) . Using lower levels of receptor 
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Croat yielded sub-nanomolar EC 50 values for these 
combination variants, but did not allow measurement of 
wild-type affinity, due to minimal binding of wild-type 
HRG-/31 EGF phage. The best combination variants 
5 contained the D4 sequence, either the B3 or E2 

sequence, and either the single MetSOIle mutation or 
the 12 sequence. 

In general, variants that exhibited enhancements 
in affinity for ErbB-3-Ig also exhibited similar 
10 enhancements in affinity for ErbB-4-Ig. 

Expression, Purification, and Assay of Soluble 
HRG-fil EGF Variants 

A number of variants were expressed in soluble 

15 form (i.e., mutated HRG-01 EGFs were expressed free of 
M13 pill) . To facilitate periplasmic expression of 
soluble variants, TAG codons were installed following 
HRG-01 226 (HRG-03, EGF 52) in the corresponding 
phagemids by Kunkel mutagenesis, and the resulting 

20 constructs were transformed into 34B8 cells (genotype: 
tonAA phoAAE15 A (argF-lac) 169 deoC2 ompTA degP41 
(APstl-kanR) ) . The constructs included those for 
expressing variants A3, E2 , and Fl , and combination 
variants HRG58 (D4+ E2+M50I ) and HRG73 (B3 +D4 +E2+I2 ) , as 

25 well as the construct that expressed wild-type HRG-01 
EGF (HRG8) . 

Expression at levels of about 1 mg/L was achieved 
as follows. Cell cultures were grown to a density of 
OD 550 = 1.0 in LB medium containing 50 pig/mL 

30 carbenicillin and used to inoculate modified AP5 medium 
at a 1/100 dilution. (Modified AP5 medium contained: ■ 
,1.5 g/L glucose, 11 g/L Hycase SF, 0.6 g/L yeast 
e^cract, 0.19 g/L anhydrous MgS0 4 or 
0.394 g/L MgS0 4 *7H 2 0, 1.07 g/L ammonium chloride, 

35 3.73 g/L KC1 , 1.2 g/L NaCl , 120 mL/L 1 M triethanol- 
amine [pH 7.4] . ) 
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Wild-type HRG-/?1 EGF and variants thereof were 
purified to homogeneity from periplasmic shockates in a 
single reverse -phase HPLC step. Briefly, cells were 
harvested af^er 24 h of growth at 30 *C (OD 550 = 1.2 ) by 
centrifugation at 4500 rpm, and the pellets were frozen 
in ethanol/dry ice for 2 h. Following resuspension and 
thawing in 5 mM MgCl 2 , 75 mM CaCl 2 , 1 mM phenyl methyl 
sulfonyl fluoride, and 10 mM Tris-HCl (pH 7.6), shocked 
cells were removed by centrifugation, leaving 
shockates. Shockates were filtered and chromatographed 
by semipreparative C18 reverse -phase HPLC using a 
gradient from 0-40% acetonitrile over 80 min, with a 
flow rate of 3 mL/min . Fractions shown by electrospray 
mass spectrometry to contain HRG-/31 EGF (or. variants 
thereof) were lyophilized and resuspended in 1 mM EDTA, 
10 mM Tris-HCl (pH 7.6) . Proteins were found to be 
essentially homogeneous as determined by amino acid 
analysis and SDS-^AGE. 

The affinities of* the soluble variants for ErbB-Ig 
were determined by measuring inhibition of 125 I-HRG 
binding to ErbB-2/3, -3 and -4 receptor-Ig fusions, as 
previously described in Example 1. Although the 
ErbB-2/3-Ig preparation also contains ErbB-2/2 and 3/3 
homodimers (as a result of coexpression of ErbB-2- and 
-3-Ig's), the displacement of 125 I-HRG should be 
predominantly from ErbB-2/3 -Ig because of the 
approximately 100-fold higher affinity of wild-type 
HRG-/31 EGF for the 2/3 -heterodimer versus the 3/3 
homodimer . 

The results are shown in Table 14 . 
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Table 14 

Binding and Activation Parameters for Soluble 
Heregulin-01 EGF Variants 

5 



Construct 


ErbB-3-Ig 
EC 50 (nM) 




ErbB-4-Ig 
EC 50 (nM) 




ErbB-2/3 
EC 50 (nM) 


-Ig 


KIRA 
EC 50 (: 


HRG8 (WT) 


2 . 3 


± 


0 . 


2 


1 . 5 ± 0 


. 1 


0 . 
0 . 


. 033 
003 


± 




0. 


212 


A3 


4 . 3 


± 


0 . 


2 


2.1 + 0 


. 1 


0 . 
0 . 


067 
007 


± 




0 . 


182 


E2 


2 . 0 


± 


0 . 


2 


2.4 ± 0. 


02 


0 . 
0 . 


035 
007 


± 




0 . 


092 


Fl 


0 .60 


± 


0. 


04 


0.26 ± 0. 


Oil 


0 . 
0 . 


061 
002 


± 




0 . 


151 


HRG5 8 


0 . 80 


± 


0 . 


01 


0.30 ± 0 . 


005 


0 . 
0 . 


075 
006 


± 




0 . 


129 


HRG7 3 


2 . 7 


+ 


0 . 


1 


0.31 ± 0 . 


003 


0 . 


20 


± o 


. 006 


0 . 


173 



The variants showed higher affinity for ErbB-3-Ig than 
15 did wild-type HRG-/31 EGF (up to 4-fold), although the 
affinity enhancements were less dramatic that 
determined by phage EL ISA . Analysis in this format 
also revealed that the E2 mutations confer greatly 
enhanced affinity, and that the additional D4 and 
20 Ket50Ile mutations (in construct HRG58) do not 

contribute additively. As observed in the phage 
ELISAs, the variants showed enhancements in affinity 
for ErbB-4-Ig that were similar to affinity- 
enhancements for ErbB-3-Ig. Variant affinities for 
25 ErbB-2/3-Ig were similar to the affinity of wild-type 
HRG-01 EGF for the 2/3-heterodimer, although the 
maximally substituted construct (HRG72) bound 6-fold 
more weakly to ErbB-2/3 - Ig than did wild-type HRG-/31 
EGF. 

30 The soluble variants were also assayed for their 

ability to stimulate tyrosine phosphorylation of the 
ErbB-2 receptor on MCF7 breast cancer carcinoma cells. 
This was accomplished in a KIRA-ELISA format as 
described in Sadick et al . , Analyt. Biochem. 235:207- 

35 214, in which the ErbB-2 phosphorylation detected is 
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believed to be due primarily to the formation of 
ErbB-2/3 heterodimers . 

Briefly, 2 x 10 5 MCF-7 cells were added to each 
well of a flat-bottom 96-well culture plate and 
5 cultured overnight. The following morning the culture 
medium was replaced with medium containing a variant or 
HRG8 (wild-type) at concentrations ranging from 0 to 
10 nM. The cells were stimulated at 37°C for 30 min, 
the culture medium was decanted. To lyse the cells and 

10 solubilize the receptors, 100 fil of lysis buffer was 
added to each well. Lysis buffer consisted of 150 mM 
NaCl, 50 mM HEPES , 0.5% Triton-XlOO, 0.01% thimerosol , 
30 klU/ml aprotinin (ICN Biochemicals , Aurora, OH) , 
1 mM 4- (2-aminoethyl) benzenesulf onyl fluoride 

15 hydrochloride (AEBSF; ICN Biochemicals) , and 2 mM 
sodium orthovanadate . The plate was then agitated 
gently on a plate shaker (Bellco Instruments, Vineland, 
NJ) for 60 min at room temperature. 

While the cells 'were being solubilized, an ELISA 

20 microtiter plate (Nunc Maxisorp™, Inter Med, Denmark) 
that had been coated overnight at 4°C with the 
affinity-purified polyclonal anti-ErbB-2 ECD antibody 
(1.0 ^ig/ml in 50 mM carbonate buffer (pH 9.6) was 
decanted, and blocked with block buffer (PBS, 0.5% BSA 

25 [Intergen Co., Purchase, NY], 0.01% thimerosol) for 60 
min at room temperature with gentle agitation. After 
60 min, the anti-ErbB-2 ECD-antibody coated plate was 
washed six times with wash buffer (PBS, 0.05% Tween 
20™, 0.01% thimerosol) using an automated plate washer 

30 (ScanWasher 300, Skatron Instruments, Inc., Sterling, 
VA) . 

, The lysate containing solubilized ErbB-2 from each 

cell-culture microtiter well was transferred to each 
anti-ErbB2 ECD antibody-coated and blocked ELISA well 
35 and incubated for 2 h at room temperature with gentle 

agitation. The unbound receptor was removed by washing 
with wash buffer, and 100 /xl of biotinylated 4G10 
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( ant iphosphotyros ine antibody) diluted to 0.2 /xg/™l in 
dilution buffer (PBS, 0.5% BSA, 0.05% Tween 20™, 
5 mM EDTA, 0.01% thimerosol) was added to each well. 
After incubation for 2 h at room temperature, the plate 
5 was washed, and 100 fxl horseradish peroxidase- 

conjugated streptavidin (Zymed Laboratories, S. San 
Francisco, CA) , diluted 1:50000 in dilution buffer, was 
added to each well. The plate was incubated for 30 min 
at room temperature with gentle agitation. The free 

10 avidin conjugate was washed away, and 100 fil freshly 
prepared substrate solution (tetramethyl benzidine, 
TMB, two-component substrate kit, Kirkegard and Perry, 
Gaithersburg, MD) was added to each well. The reaction 
was allowed to proceed for 10 min, after which the 

15 color development was stopped by the addition of 1.0 M 
H 3 P0 4 . 

The results are shown in Table 14. The EC 50 is the 
concentration of variant (or HRG8) required to achieve 
50% maximal tyrosine* phosphorylation . In general, the 
20 EC 50 's for stimulation of phosphorylation .by the 

variants tested did not differ substantially from the 
EC 50 for wild- type HRG-01 EGF . In addition, the values 
correlated well with the IC B0 's for binding to the 
receptor fusions above. 

25 

EXAMPLE 4 

Selection of Heregul in- 61 EGF Domain Variants 
For ErbB-4 Receptor Binding 
Using Monovalent Phage Display 
30 This example describes the selection of HRG-01 

variants for binding to the ErbB-4 receptor, and in 
4 particular, variants having a greater specificity for 
the ErbB-4 receptor, relative to the ErbB-3 receptor, 
than wild- type HRG-/31. Such variants have, for 
35 example, a lower variant : wild- type EC 50 ratio for 

binding to ErbB-4-Ig than for binding to ErbB-3-Ig. 
The variants studied contained residues corresponding 
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to HRG-/31 177-244, which includes the minimal EGF-like 
domain (HRG-/31 177-228) . In this example, "HRG-/31 EGF" 
refers to the region of the EGF-like domain extending 
from residues 177-244. Residue numbers are expressed 
5 in terms of position in 645-amino acid native human 
HRG-/B1 and, in parentheses, in terms of position in 
HRG-01 EGF (i.e., HRG-01 EGF 1-68). 

The variants were produced by randomization at 
Hisl78, Leul79, and Arg207 (His2, Leu3 , and Arg31) . 

10 These residues were chosen because alanine scanning 

(Example 2) indicated that the substitution of alanine 
at these positions resulted in a significantly greater 
loss of affinity for the ErbB-3 receptor, as compared 
to the ErbB-4 receptor, suggesting that these residues 

15 may be more important for ErbB-3 receptor binding than 
for ErbB-4 receptor binding. Additionally, these 
residues are predicted to be proximal to one another on 
the surface of the, HRG-/31 molecule, potentially forming 
a binding site. ' - 

20 

Phage Library Screening for ErbB-4 -Ig Binding 

Variants of HRG-/31 EGF were prepared and selected 
for binding to ErbB-4 -Ig using monovalent phage 
display, according to the method of Bass et al . , 

25 Proteins 8:309-14 (1990). Briefly, the HRG-/S1 EGF 

phagemid vector was pHRG2-p3 (described in Example 1) , 
in which HRG-/31 177-244 was fused to a C-terminal 
fragment of the M13 coat protein pill. Kunkel 
mutagenesis was performed to introduce stop codons into 

30 this vector at Hisl78, Leul79, and Arg207 (His2, Leu3 , 
and Arg31) to ensure that the starting vector could 
4 not express the wild-type polypeptide. 

These sites were then randomized by Kunkel 
mutagenesis to produce an HRG-/31 EGF library. Phage 

35 displaying mutated HRG-01 EGFs monovalently were 

produced from the library. See Bass et al . , supra. 
These phage were then sorted against ErbB-4 -Ig 
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homodimers immobilized on an ELISA plate. Bound phage 
were eluted and used to reinfect host cells, which were 
used to produce new phage. Half of the phage from the 
first round of sorting was sorted against immobilized 
5 ErbB-4-Ig for three additional rounds of sorting. The 
other half of the phage from the first round was sorted 
against immobilized ErbB-4-Ig for three additional 
rounds in the presence of soluble ErbB-3-Ig (10 nM) . 
Sorting the presence of soluble ErbB-3-Ig (i.e., 

10 "counter-selecting" against ErbB-3-Ig) was expected to 
remove variants with higher affinity for ErbB-3-Ig, 
allowing enrichment of those with higher affinity for 
ErbB-4-Ig in each round of sorting. After sorting, 
twelve clones from each of the two resultant libraries 

15 were sequenced. 

Construction of Initial Phage Library 

The initial phage library was constructed by 
Kunkel mutagenesis using uracil - containing single- 

20 stranded DNA template. TAA and TGA stop "codons were 
installed at positions selected for randomization to 
generate a custom template that eliminated wild-type 
background from the pools. Positions were fully 
randomized by mutation to NNS codons (where N is any of 

25 the four bases and S is either G or C) . Two 

mutagenesis oligonucleotides were used, one to 
randomize Hisl78 and Leul79 (His2 and Leu3) and one to 
randomize Arg207 (Arg31) . The oligonucleotides 
contained 15-base overhangs on either side of the 

30 randomized residues. 

The final mutagenesis reaction mixture was 
4 electro.- transformed into XL-1 blue cells (Stratagene, 
Inc., La Jolla, CA) , according to the manufacturer's 
protocol. The transformed cells were then infected 

35 with 10 11 pfu M13K07 helper phage (Promega Corp., 

Madison, WI) , and phage stocks (about 10 14 phagemid/mL) 
were prepared as described in Example 1. 
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At least 10 8 transf ormants were obtained, 
indicating that the library had excellent 
representation of the possible amino acid sequence 
combinations . 

5 

Selection of Phage for ErbB~4~Ig Binding 

Monovalent phage were prepared and the selection 
performed on ErbB-4-Ig prebound to microtiter plates 
via capture with polyclonal antibodies to the human Fc 
10 fragment, as described in Example 1. Approximately 10 12 
phage in 100 binding buffer (PBS, 0.1% BSA, 0.05% 
Tween 20™) were applied to an ErbB-4-Ig-coated well and 
a control well to which no ErbB-4-Ig had been added. 
After the first round of sorting, half of the resultant 
15 phage (hereafter the " counter- selected library") were 
sorted in the presence of 10 mM ErbB-3-Ig in the 
binding buffer. Following a 2 h incubation at room 
temperature, the plates were washed extensively (12x) 
and phage eluted by adding 100 (jlIj of a solution of 
20 50 mM HC1 and 0.05% Tween™ 20 and shaking for 10 min. 

Eluates were neutralized with 10 jxL 1 M Tris-HCl 
(pH 8.0) and 20 jzL used for titration on log-phase XL-1 
blue cells. The remainder was used to infect 1 mL of 
log-phase XL-1 blue cells (30 min at 37'C), which were 
25 then superinf ected with 2 x 10 10 pfu M13K07 phage and 

grown in 25 mL 2YT broth (16 g/L tryptone, 10 g/L yeast 
extract, 5 g/L NaCl) containing 50 M9/™ L carbenicillin 
for 18-24 h. Phage were harvested as described above 
and the cycle repeated. 
30 After round four of selection, twelve clones from 

each library were randomly picked and sequenced by the- 
^dideoxy method. See Sanger et al . , PNAS USA 74:5463-67 
(1977) . The amino acids at the randomized positions 
deduced from the DNA sequences are shown in Table 15 (a 
35 " . " indicates a residues that is identical to the wild- 
type residue) . 
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10 



15 



35 



Table 15 

Mutations and Affinities for ErbB-3- and ErbB-4-Igs 

in Phage ELISA 

Four Rounds of Sorting for ErbB-4-Ig Binding 
(No Counter-Selection) : 

Wild-type H L R ^Siblings *- ErbB-4 ErbB-3 

EC c 0 Mut/WT* EC C . Mut/WT 

Clones M 7 1.5 1.2 

W 2* 1.7 1.2 

E 2 1.7 9.5 

V 1 1.6 1.5 



Four Rounds of Sorting for ErbB-4 -Ig Binding 
With ErbB-3-Ig Counter- Select ion : 

20 Clones P 7 1.3 6.1 

L 1* 1.7 6.4 

L 1 1.2 1.4 

D V 1 2.0 2.2 

E V 1 1.7 0.8 

25 TM. 1 1.6 1.4* 

'HRG-01 228-331 (HRG-£l EGF 52-55) were spontaneously 
deleted and replaced by a single Met residue. 

30 'Number of variants having this sequence (out of the 
twelve sequenced for each library) . 



*EC 50 for the variant divided by the EC 50 for wild-type 
HRG-/31 177-244 expressed monovalent ly on phage 



One variant predominated in each library, being 
represented in seven of the twelve clones sequenced 
from each library. The predominant variant from the 
40 library counter - selected against ErbB-3-Ig sorted back 
to the wild-type residue at positions Hisl78 and Leul79 
(His2 and Leu3) and had Pro at Arg207 (Arg31) . The 
^predominant variant from the library that was sorted 
without counter-selection sorted to wild-type at 
45 positions Hisl78 and Arg207 (His2 and Arg31) and had 
Met at Leul79 (Leu3) . In variants from each of the 
libraries, there was a spontaneous substitution of Met 
for HRG-/31 228-231 (HRG-£l EGF 52-55) . 



-95- 



405519 

Analysis of the Impact of Mutations on Receptor 
Affinity and Specificity 

Receptor binding affinities were measured by phage 
EL ISA for all unique variants from each library, as 
5 described in Example 1. The results are shown in 

Table 15. The affinities of the variants for ErbB-3-Ig 
and ErbB-4-Ig were determined to assess specificity. 

The affinities of most variants for the ErbB 
receptor-Ig fusions were reduced between about 1.2-fold 

10 and 2.2-fold. Only one variant showed a slight 
enhancement in affinity, and only for ErbB-3-Ig. 
However, three variants had affinities for ErbB-4-Ig 
that were less than two-fold below the affinity of 
wild- type HRG-/31, whereas their affinities for 

15 ErbB-3-Ig were 6- to 9-fold below the affinity of wild- 
type HRG-jSl. All of these variants sorted back to the 
wild- type residue at Leul79 (Leu3) . One variant sorted 
to Glu at Hisl78 (His2) and to wild-type at Arg207 
(Arg31) . Another variant sorted to wild-type at Hisl78 

20 (His2) and to Pro at Arg 207 (Arg31) . This was the 
predominant variant in the counter- selected library. 

The third variant sorted to Leu at Hisl78 (His2) 
and to wild-type at Arg207 (Arg31) and also had the 
spontaneous substitution of Met for HRG-/31 228-231 

25 (HRG-jSl EGF 52-55) . The affinity of this variant for 

ErbB-3-Ig was reduced 6.4 fold, whereas another variant 
with the same sequence at Hisl78, Leul79, and Arg207 
(His2, Leu3, and Arg31), but lacking the spontaneous 
substitution, retained approximately wild-type affinity 

30 for 

ErbB-3-Ig. Thus, the reduction in affinity for 
, ErbB-3-Ig binding is largely attributable to this 
spontaneous substitution. Because HRG-/31 228-231 
(HRG-/31 EGF 52-55) is unlikely to be proximal tc the 
3 5 randomized region on the surface of the HRG-01 

molecule, the data suggest that at least two distinct 
regions of HRG-01 are important for ErbB receptor 
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binding, namely residues Hisl78, Leul79, and Arg207 
(His2 # Leu3, and A~g31) and HRG-/31 228-231 (HRG-/31 EGF 
52-55) . 

Two of the three variants exhibiting enhanced 
5 specificity for ErbB-4-Ig binding accounted for eight 

of the twelve clones isolated from the counter- selected 
library. The other variant accounted for only two of 
the twelve clones isolated from the library that was 
sorted without counter- select ion . Thus, counter- 
10 selection against ErbB-3-Ig produced a significant 

enrichment in variants exhibiting greater specificity 
for ErbB-4-Ig, relative to ErbB-3-Ig, than that of 
wild-type HRG-01. 



-97- 



iauai?74-?' . ptfiri hup 



403426 vl 



SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

5 

(i) APPLICANT: Ballinger, Marcus D . 

Jones, Jennifer T. 
Fairbrother, Wayne J. 
Sliwkowski, Mark X. 
10 Wells, James A. 

Genentech, Inc. 

(ii) TITLE OF INVENTION: HEREGULIN VARIANTS 

15 (iii) NUMBER OF SEQUENCES: 92 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Skjerven, Morrill, MacPherson, et al . 

(B) STREET: 25 Metro Drive, Suite 700 
2 0 (C) CITY: San Jose 

(D) STATE: California 

(E) COUNTRY: USA 

(F) ZIP: 95110 

2 5 (v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: Pending 

(B) FILING DATE: 10-FEB-1998 

3 5 (vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/799,054 

(B) FILING DATE: 10-FEB-1997 

(C) CLASSIFICATION: 

4 0 (viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Haliday, Emily M. 

(B) REGISTRATION NUMBER: 38,903 

^ (C) REFERENCE /DOCKET NUMBER: M- 4 761 -IP PCT 

4 5 (ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (408) 453-9200 

(B) TELEFAX: (408) 453-7979 



5 0 (2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 amino acids 

(B) TYPE: amino acid 
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25 



30 



40 



(C) TTRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
15 1 5 10 15 

Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr 
20 25 30 

2 0 Leu Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr 
35 40 45 

Val Met Ala Ser Phe Tyr Lys His Leu Gly lie Glu Phe Met Glu Ala 
50 S5 - 60 



Glu Glu Leu Tyr Gin Lys Arg 
65 70 

(2) INFORMATION FOR SEQ ID NO : 2 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 
3 5 (D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



4 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
15 10 15 

50 Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr 
20 25 30 

Leu Cys Lys Cys Gin Pro Gly Phe Thr Gly Ala Arg Cys Thr Glu Asn 
35 40 45 
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Val Pro Met Lys Val Gin Asn Gin Glu Lys Ala Glu Glu Leu Tyr Gin 
50 55 60 

5 Lys Arg 
65 

(2) INFORMATION FOR SEQ ID NO : 3 : 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

15 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

2 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
15 10 15 



Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr 
30 20 25 30 

Leu Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr 
35 40 45 

3 5 Val Met Ala Ser Phe Tyr Lys Ala Glu Glu Leu Tyr Gin Lys Arg 

50 55 60 

(2) INFORMATION FOR SEQ ID NO : 4 : 

4 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 5 amino acids 

(B) TYPE: amino acid 

, ^(C) STRANDEDNESS: not relevant 

'(D) TOPOLOGY: not relevant 

45 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
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Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
x 5 10 15 

Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr 
20 25 30 

Leu Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr 
35 40 45 

Val Met Ala Ser Phe Tyr Ser Thr Ser Thr Pro Phe Leu Ser Leu Pro 
50 55 60 



Glu 
15 65 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 
2 0 (A) LENGTH: 6 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY:' not relevant 

2 5 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE : 

3 0 (A) ORGANISM: Rattus rattus 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

3 5 Ser His Leu He Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
1 ' 5 10 15 

Gly Gly Glu Cys Phe Thr Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr 
20 25 30 



40 



Leu Cys Lys Cys Gin Pro Gly Phe Thr Gly Ala Arg Cys Thr Glu Asn 
3 5 4 0 4 5 



Val Pro Met Lys Val Gin Thr Gin Glu Lys Ala Glu Glu Leu Tyr Gin 
45 50 55 60 

Lys Arg 
65 

5 0 (2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rattua rattus 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 



Ser His Leu lie Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
15 1 5 10 15 

Gly Gly Glu Cys Phe Thr Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr 
20 25 30 

2 0 Leu Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr 
35 40 45 



25 



30 



Val Met Ala Ser Phe Tyr Lys His Leu Gly lie Glu Phe Met Glu Ala 
50 5*5 * 60 

Glu Glu Leu Tyr Gin Lys Arg 
65 70 

(2) INFORMATION FOR SEQ ID NO : 7 : 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 
3 5 (D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 



40 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rattus rattus 



4 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Ser His Leu lie Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
1 5 10 15 

5 0 Gly Gly Glu Cys Phe Thr Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr 

20 25 30 

Leu Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr 
35 40 45 
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Val Met Ala Ser Phe Tyr Lys Ala Glu Glu Leu Tyr Gin Lys Arg 
50 55 60 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rattus rattus 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Ser His Leu lie Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
1 5 ' 1 10 15 

Gly Gly Glu Cys Phe Thr Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr 
20 25 30 

Leu Cys Lys Cys Gin Pro Gly Phe Thr Gly Ala Arg Cys Thr Glu Asn 
35 40 45 

Val Pro Met Phe Tyr Ser Thr Ser Thr Pro Phe Leu Ser Leu Pro Glu 

50 55 60 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

' (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rattus rattus 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
Ser His Leu lie Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
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1 5 

Gly Gly Glu Cys Phe Thr Val Lys 
20 

Leu Cys Lys Cys Gin Pro Gly Phe 
35 40 

Val Pro Met Phe Tyr Ser Met Thr 
50 55 

Glu Lys Pro Leu Glu Arg Lys Leu 
65 70 

Lys 



10 is 

Asp Leu Ser Asn Pro Ser Arg Tyr 
2 5 30 

Thr Gly Ala Arg Cys Thr Glu Asn 
45 

Ser Arg Arg Lys Arg Gin Glu Thr 
60 

Phe His Ser Leu Val Lys Glu Ser 
75 80 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : not relevant 
(D) TOPOLOGY: not. relevant 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
15 io 15 

Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr 
2 0 2 5 3 0 

Leu Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr 
3 5 4 0 4 5 

Val Met Ala Ser Phe Tyr Ser Thr Ser Thr Pro Phe Leu Ser Leu Pro 
50 55 60 

Glu 
65 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 5 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

5 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL : NO 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 

15 Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
15 10 15 



20 



45 



50 



Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr 
20 25 30 

Leu Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr 
35 40 45 



Val Met Ala Ser Phe Tyr Ser. Thr Ser Thr Pro Phe Leu Ser Leu Pro 
25 50 55 60 

Glu 
65 

3 0 (2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 5 amino acids 

(B) TYPE: amino acid 

3 5 (C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

4 0 (iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 

15 10 15 

Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr 
20 25 30 

Leu Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr 
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35 40 45 

Val Met Ala Ser Phe Tyr Ser Thr Ser Thr Pro Phe Leu Ser Leu Pro 
50 55 60 

Glu 
65 

(2) INFORMATION FOR SEQ ID NO:13: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 amino acids 

(B ) TYPE: amino acid 

(C) S TRANDEDNE S S : not relevant 
15 (D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Gallus domesticus 



2 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Ser His Leu Thr Lys Cys Asp lie Lys Gin Lys Ala Phe Cys Val Asn 
15 10 15 

3 0 Gly Gly Glu Cys Tyr Met Val Lys Asp Leu Pro Asn Pro Pro Arg Tyr 

20 25 30 



Leu Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr 

35 40 45 

Val Met Ala Ser Phe Tyr Lys His Leu Gly lie Glu Phe Met Glu Ala 
50 55 60 



Glu Glu Leu Tyr Gin Lys Arg 
40 65 70 

(2) INFORMATION FOR SEQ ID NO : 14 : 

4 (i) ^SEQUENCE CHARACTERISTICS: 

4 5 (A) LENGTH: 4 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

5 0 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 



(vi) ORIGINAL SOURCE : 
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(A) ORGANISM: Not relevant (recombinant) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
1 5 10 15 



Gly Gly Glu Cys Phe Met Val Lys Asp Pro Ser Arg Tyr Leu Cys Lys 
10 20 25 30 

Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr Val Met Ala 
35 40 45 

15 Ser 



(2) INFORMATION FOR SEQ ID NO: 15: 

2 0 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: n6t .relevant 

25 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 

3 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Asn Ser Asp Ser Glu Cys Pro Leu Ser His Asp Gly Tyr Cys Leu His 
15 10 15 



Asp Gly Val Cys Met Tyr lie Glu Ala Leu Asp Lys Tyr Ala Cys Asn 

40 20 25 30 

Cys Val Val Gly Tyr lie Gly Glu Arg Cys Gin Tyr Arg Asp Leu Arg 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 16: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 amino acids 
5 0 (B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 
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(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO : 16 : 

10 Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
15 10 15 

Gly Gly Glu Cys Phe Met Val Lys Asp Pro Ser Arg Tyr Leu Cys Lys 
20 25 30 

Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr Val lie Ala 
35 40 45 

Ser 

20 

(2) INFORMATION FOR SEQ ID NO: 17: 

t 

(i) SEQUENCE CHARACTERISTICS : 

2 5 (A) LENGTH: 52 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

3 0 <ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

3 5 (A) ORGANISM: Not relevant (recombinant) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 17 : 

4 0 Trp Glu Leu Val Pro Cys Gly Trp Asp Arg Glu Gly Phe Cys Val Asn 

15 10 15 

Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr 
, , 20 25 30 

45 

Leu Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr 
35 40 45 

Val lie Ala Ser 
50 50 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

-108- 



:i nils P "7 -I- 7 ... -31 1 ill o » 



403426 vl 



(A) LENGTH: 4 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

5 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 



15 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Trp Glu Leu Val Pro Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
1 5 10 15 ' 



Gly Gly Glu Cys Tyr Lys Val Arg lie Tyr Gly Tyr Leu Met Cys Lys 
20 20 25 30 

Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr Val lie Ala 
35 • 40 45 

25 Ser 



(2) INFORMATION FOR SEQ ID NO: 19: 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

35 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 

4 0. (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 



* (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Trp Glu Leu Val Pro Cys Gly Trp Asp Arg Glu Gly Phe Cys Val Asn 
15 10 15 



Gly Gly Glu Cys Tyr Lys Val Arg lie Tyr Gly Tyr Leu Met Cys Lys 

50 20 25 30 

Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr Val lie Ala 
35 40 45 
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Ser 



(2) INFORMATION FOR SEQ ID NO: 20: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 
10 (D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 



2 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Trp Glu Leu Val Pro Cys Gly Trp Asp Arg Glu Gly Phe Cys Val Asn 
1 5 ' 10 15 

2 5 Gly Gly Glu Cys Tyr Lys Val Arg He Tyr Arg Tyr Arg Met Cys Lys 
20 25 ' 30 

Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr Val He Ala 
35 40 45 



Ser 



(2) INFORMATION FOR SEQ ID NO: 21: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 
4 0 (D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

> (iii) HYPOTHETICAL: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 



50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
1. 5 10 15 
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Gly Gly Glu Cys Phe Met Val Lys Asp Tyr Gly Tyr Leu Met Cys Lys 
20 25 30 

Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr Val lie Ala 
5 35 40 45 

Ser 



10 (2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 

(B) TYPE: amino acid 

15 (C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

2 0 (iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM:, Not relevant (recombinant) 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Ser His Leu Val Lys Cys Gly Glu Glu Arg Glu Gly Phe Cys Val Asn 
15 10 15 

30 

Gly Gly Glu Cys Tyr Arg Val Lys Thr Leu Ser Asn Pro Ser Arg Tyr 
20 25 30 

Leu Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr 
35 35 40 45 

Val Met Ala Ser 
50 

40 (2) INFORMATION FOR SEQ ID NO:23: 

( i ) S EQUENCE CHARACTER I S T I CS : 

(A) LENGTH: 4 9 amino acids 
* JLB) TYPE: amino acid 

45 (C) STRANDEDNESS ■ not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

50 (iii) HYPOTHETICAL. NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Ser His Leu Val Lys Cys Gly Glu Glu Arg Glu Gly Phe Cys Val Asn 
5 1 5 10 15 

Gly Gly Glu Cys Phe Met Val Lys Asp Tyr Gly Tyr Leu Met Cys Lys 
20 25 30 

10 Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr Val Met Ala 
35 40 45 



15 



25 



30 



45 



Ser 



(2) INFORMATION FOR SEQ ID NO:24: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 amino acids 
2 0 (B) TYPE: amino acid 

(C) STRANDEDNE SS : not relevant 

(D) TOPOLOGY : not relevant 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



Ser His Leu Val Lys Cys Gly Glu Glu Arg Glu Gly Phe Cys Val Asn 
35 1 5 10 15 

Gly Gly Glu Cys Tyr Arg Val Lys Thr Tyr Gly Tyr Leu Met Cys Lys 
20 25 30 

4 0 Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr Val Met Ala 
35 40 45 



Ser 

(2) INFORMATION FOR SEQ ID NO: 25: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 
5 0 <B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 
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(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL, SOURCE: 

(A) ORGANISM : Not relevant (recombinant) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

10 Ser His Leu Val Lys Cys Gly Glu Glu Arg Glu Gly Phe Cys Val Asn 
15 10 15 

Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr 
20 25 30 

Leu Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr 
35 40 45 



Val He Ala Ser 
20 50 

(2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS: 

2 5 (A) LENGTH: 4 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

3 0 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Not relevant (recombinant) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

4 0 Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 

15 10 15 



Gly Gly Glu Cys Tyr Arg Val Lys Thr Tyr Gly Tyr Leu Met Cys Lys 
4 * 20 25 30 

Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr Val Met Ala 
35 40 45 

Ser 



(2) INFORMATION FOR SEQ ID NO: 27: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 4 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

5 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
15 10 15 

2 0 Gly Gly Glu Cys Tyr Arg Val Lys Thr Tyr Gly Tyr Leu Met Cys Lys 
20 25 30 

Cys Pro Asn Glu Phe Thir Gly Asp Arg Cys Gin Asn Tyr Val lie Ala 
35 • 40 45 



25 



30 



40 



45 



50 



Ser 



(2) INFORMATION FOR SEQ ID NO:28: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 
3 5 (D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
15 10 15 

Gly Gly Glu Cys Tyr Arg Val Lys Thr Leu Ser Asn Pro Ser Arg Tyr 
20 25 30 



Leu Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr 
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35 40 45 

Val lie Ala Ser 
50 

5 

(2) INFORMATION FOR SEQ ID NO : 2 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 
10 (B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 



15 



20 



30 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 



(Xi) SEQUENCE DESCRIPTION : SEQ ID NO : 2 9 : 

2 5 Ser His Leu Val Lys Cys Gly Glu Glu Arg Glu Gly Phe Cys Val Asn 
1 5 10 ' 15 

Gly Gly Glu Cys Tyr Arg Val Lys Thr Leu Ser Asn Pro Ser Arg Tyr 
20 25 30 



Leu Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr 
35 40 45 



Val lie Ala Ser 
35 50 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 
4 0 (A) LENGTH: 4 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

4 5 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Not relevant (recombinant) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
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Ser His Leu Val Lys Cys Gly Glu Glu Arg Glu Gly Phe Cys Val Asn 
1 5 10 15 

5 Gly Gly Glu Cys Phe Met Val Lys Asp Tyr Gly Tyr Leu Met Cys Lys 
20 25 30 

Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr Val lie Ala 
35 40 45 



Ser 



(2) INFORMATION FOR SEQ ID NO: 31: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 
2 0 (D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 
i 

(iii) HYPOTHETICAL: NO' . 



<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 



30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Ser His Leu Val Lys Cys Gly Glu Glu Arg Glu Gly Phe Cys Val Asn 
15 10 15 

35 

Gly Gly Glu Cys Tyr Arg Val Lys Thr Tyr Gly Tyr Leu Met Cys Lys 
20 25 30 

Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr Val lie Ala 
40 35 40 45 

Ser 

4 5 (2) INFORMATION FOR SEQ ID NO: 32. 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 amino acids 

(B) TYPE: amino acid 

50 (C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 
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(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 

10 Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
15 10 15 

Gly Gly Glu Cys Tyr Arg Val Lys Thr Tyr Gly Tyr Leu Met Cys Lys 
20 25 30 



Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin His Tyr Val lie Ala 
35 40 45 

Ser 



(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

2 5 (A) LENGTH: 4 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

3 0 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Not relevant (recombinant) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 

Ser His Leu Val Lys Cys Gly Glu Glu Arg Glu Gly Phe Cys Val Asn 
1 5 10 15 



6ly Gly Glu Cys Tyr Arg Val Lys Thr Tyr Gly Tyr Leu Met Cys Lys 
45 20 25 30 

Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin His Tyr Val lie Ala 
35 40 45 

50 Ser 



(2) INFORMATION FOR SEQ ID NO : 34 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 4 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

<vi) ORIGINAL SOURCE: 

• (A) ORGANISM: Not relevant (recombinant) 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 



Gly Gly Gly Ser 
1 

(2) INFORMATION FOR SEQ ID NO : 3 5 : 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 amino acids 
25 (B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

4 0 Gly Gly Gly Ser Gly Gly Gly 

1 5 

(2) INFORMATION FOR SEQ ID NO:36: 

4 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 36: 

Thr Arg Asp Lys Thr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 37: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 5 amino acids 
15 (B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

3 0 Asp Asp Asp Asp Lys 

1 5 

(2) INFORMATION FOR SEQ ID NO: 38: 

3 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

40 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 

4 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

5 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Ser His Leu Val Lys 
1 5 
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(2) INFORMATION FOR SEQ ID NO : 3 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 9 : 

2 0 Trp Arg Leu Val Pro 

1 5 

(2) INFORMATION FOR SEQ ID NO: 40: 

2 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

30 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 

3 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM : Not relevant (recombinant) 



4 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Trp Ser Leu Gin Pro 
1 5 

4 5 (2)' INFORMATION FOR SEQ ID NO: 41: 

<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

50 (C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 
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(iii) HYPOTHETICAL : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 1 : 

10 Trp Glu Leu Val Pro 

1 5 

(2) INFORMATION FOR SEQ ID NO: 42: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : not relevant 
(D) TOPOLOGY: not relevant 

20 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: 'NO 

2 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 



30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 

Trp Ser Leu Val Lys 
1 5 

3 5 (2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

4 0 (C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

4 5 (iii) HYPOTHETICAL: NO- 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 
Trp Ser Leu lie Pro 
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1 5 

(2) INFORMATION FOR SEQ ID NO:44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 

Trp Arg Leu Val Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids . 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 5 : 

' Trp -Ala Leu Val Pro 

1 5 

(2) INFORMATION FOR SEQ ID NO:46: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 
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(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 



10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

Trp Ser Leu Gin Lys 

1 5 

15 (2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS :♦ 

(A) LENGTH : 5 amino acids 

(B) TYPE: amino acid 

2 0 (C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

25 (iii) HYPOTHETICAL: NO ' ' 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:47: 

Trp Glu Leu Val Ala 
35 1 5 

(2) INFORMATION FOR SEQ ID NO:48: 

(i) SEQUENCE CHARACTERISTICS: 
4 0 (A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

4 5 ' (ii) MbLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Not relevant (recombinant) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 
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Trp Ser Leu Glu Pro 
1 5 

5 (2) INFORMATION FOR SEQ ID NO : 4 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

10 (C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

15 (iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



20 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 9 : 



Ala Glu Lys Glu Lys Thr 
25 1 5 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 
3 0 (A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

3 5 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

4 0 (A) ORGANISM: Not relevant (recombinant) 



* (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Gly Val Gly Arg Asp Gly 
1 5 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 
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(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 



Gly Gly Glu Arg Glu Gly 
15 1 5 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS : 
2 0 (A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: 'not relevant 

2 5 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

3 0 (A) ORGANISM: Not relevant (recombinant) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Gly Glu Glu Arg Glu Gly 
1 5 

(2) INFORMATION FOR SEQ ID NO: 53: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

' (C) STRANDEDNESS: not relevant 

4 5 (D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Not relevant (recombinant) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:53: 

Gly Trp Asp Arg Glu Gly 
5 1 5 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

15 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 
2 0 (A) ORGANISM: Not relevant (recombinant) 



25 



30 



40 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Gly Val Gin Arg Glu Gly 

1 1 5 

(2) INFORMATION FOR SEQ ID NO: 55: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 
3 5 (D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Gly Glu Glu Arg Ala Gly 
1 5 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 6 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

5 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Not relevant (recombinant) 



15 



20 



30 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 56: 

Gly Lys Glu Arg Glu Gly 
1 5 

(2) INFORMATION FOR SEQ ID NO: 57: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: .not relevant 
2 5 (D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 



35 



40 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Thr Asn Ser Arg Glu Gly 
1 5 

(2) INFORMATION FOR SEQ ID NO:58: 



(i) SEQUENCE CHARACTERISTICS: 
# ^(A) LENGTH: 6 amino acids 

4 5 ^ (B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

Asp Lys Ser Arg Glu Gly 
1 5 

(2) INFORMATION FOR SEQ ID NO: 59: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 
15 (D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 



Gly Glu Asp Arg Lys Gin 
1 5 

(2) INFORMATION FOR SEQ ID NO: 60: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6 amino acids 
3 5 (B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

- (A) ORGANISM: Not relevant (recombinant) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 



5 0 Gly Arg Glu Arg Glu Gly 

1 5 

(2) INFORMATION FOR SEQ ID NO: 61: 
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10 



15 



20 



30 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

Val Asn Gly Gly Glu 
1 5 



(2) INFORMATION FOR SEQ ID NO: 62 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 5 amino acids 
2 5 (B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

4 0 Val Asn Gly Gly Glu 

1 5 

(2) INFORMATION FOR SEQ ID NO: 63: 

4 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino ccid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

50 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 



10 



20 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

Val Asn Gly Gly Val 
1 5 

(2) INFORMATION FOR SEQ ID NO: 64: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 5 amino acids 
15 (B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: I^ot relevant (recombinant) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

3 0 Val Asn Gly Gly Gin 

1 5 

(2) INFORMATION FOR SEQ ID NO: 65: 

3 5 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

40 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 

4 5 (vi) ORIGINAL SOURCE : 

(A) ORGANISM: Homo sapiens 



50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

Phe Met Val Lys Asp 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:66: 

2 0 Tyr Lys Val Arg lie 

1 5 

(2) INFORMATION FOR SEQ ID NO: 67: 

2 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

30 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 

3 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 



4 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

Phe Arg Val Lys Thr 
1 5 

4 5 (2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

50 (C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 
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(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

10 Tyr Arg Val Lys Thr 

1 5 

(2) INFORMATION FOR SEQ ID NO: 69: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

20 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: ,NO 

2 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 



30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

Tyr Met lie Lys Tyr 
1 5 

3 5 (2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

4 0 (C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

45 (iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
Tyr Met Val Lys Thr 
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1 5 

(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 



2 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

Met Arg Val Arg Thr 
1 5 • 

2 5 (2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

3 0 (C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 
3 5 (iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:72: 

' Pro Ser Arg Tyr Leu 

45 1 5 

(2) INFORMATION FOR SEQ ID NO:73: 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 
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(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 



10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

Thr Pro Tyr Leu Met 
1 5 

15 (2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

20 (C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE- peptide 

2 5 (iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Not relevant (recombinant) 

3 0 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 74 : 

Tyr Gly Tyr Leu Met 
35 1 5 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

4 0 (A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

4 5 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Not relevant (recombinant) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
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Tyr Arg Tyr Arg Met 
1 5 

5 (2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

10 (C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

15 (iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 



20 



45 



50 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 



Thr His Tyr Arg Gly t 
25 l 5 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 
3 0 (A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

3 5 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Not relevant (recombinant) 



# (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

*fhr His Tyr Arg Met 
1 5 

(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 
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(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:78: 



Tyr Lys Tyr Arg Met 
15 1 5 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 
2 0 (A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

2 5 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

3 0 (A) ORGANISM: Not relevant (recombinant) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

Thr Lys Tyr Arg Gly 
1 5 

(2) INFORMATION FOR SEQ ID NO: 80: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 5 amino acids 

(B) TYPE: amino acid 

^ (C) STRANDEDNESS : not relevant 

4 5 '(D) TOPOLOGY: not relevant 

4 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

50 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

Tyr Lys Tyr Arg Leu 
5 1 5 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS : 
10 (A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

15 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHET I CAL : NO 

(vi) ORIGINAL SOURCE: 
2 0 (A) ORGANISM: Homo sapiens 



25 



30 



40 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

Lys Cys Pro Asn Glu Phe 
1 5 

(2) INFORMATION FOR SEQ ID NO: 82: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 
3 5 (D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:82: 

Arg Cys Ser Leu Glu Phe 
1 5 



(2) INFORMATION FOR SEQ ID NO:83: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

5 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Not relevant (recombinant) 



15 



20 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

Arg Cys Ser Glu Glu Phe 
1 5 

(2) INFORMATION FOR SEQ ID NO: 84: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amiVno acid 

(C) STRANDEDNESS : not relevant 
25 (D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 



35 



40 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

Lys Cys Pro Lys Glu Met 
1 5 

(2) INFORMATION FOR SEQ ID NO: 85: 



(i) SEQUENCE CHARACTERISTICS: 
, ^(A) LENGTH: 6 amino acids 

4 5 (B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 



(ii) MOLECULE TYPE : peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

Arg Cys Thr Val Glu Tyr 
1 5 

(2) INFORMATION FOR SEQ ID NO:86: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6 amino acids 
<B) TYPE: amino acid 
(C) STRANDEDNESS : not relevant 
15 (D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 



Arg Cys Thr Val Glu Tyr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 87: 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 6 amino acids 
3 5 (B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 



5 0 Lys Cys Asn Ser Glu Phe 

1 5 

(2) INFORMATION FOR SEQ ID NO: 88: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:88: 



Arg Cys Lys Lys Glu Phe 
1 5 

(2) INFORMATION FOR SEQ ID NO: 89: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: ^ amino acids 
2 5 (B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

4 0 Gin Asn Tyr Val Met 

1 5 

(2) INFORMATION FOR SEQ ID NO: 90: 

4 5 ' (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

Gin Trp Tyr Val lie 
1 5 

(2) INFORMATION FOR SEQ ID NO: 91: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 5 amino acids 
15 (B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 



(ii) MOLECULE TYPL : peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Not relevant (recombinant) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

30 Gin His Tyr Val lie 

1 5 

(2) INFORMATION FOR SEQ ID NO: 92: 

3 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 52 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: not relevant 
(D) TOPOLOGY: not relevant 

40 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETI CAL : NO 

4 5 ' (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

5 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 92 : 

Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
15 10 15 
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Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr 
20 25 30 

Leu Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr 
35 40 45 

Val Met Ala Ser 
50 
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